Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3444 |
Symbol | |
ID | 8727197 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4175872 |
End bp | 4177179 |
Gene Length | 1308 bp |
Protein Length | 435 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | |
Product | para-aminobenzoate synthase, subunit I |
Protein accession | YP_003388251 |
Protein GI | 284038321 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.716101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 49 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTCT ATGTGCCGAT TGAAAACCCG CTGGTATGGC GTTGGCAGGC CTTAGCCTGG GCTATCAGTC AGGAAGCACA TTCGGATGGC TTTGTTGCCT TCCTGAACAA CAACGAGACC ACTTACCCTA ATGGCCCGTT TCCCAACCGA CTTTTTGTTG GTTCGAAACG GGTTGTTTCT TTTTCAGATG TAGCTGGCCC GGACGCCTTC CGGGTGTTGG AGAAGGCGCA CCGTGATGAG CCGTCATTTT TGGTCGGCTA CTTCGGCTAT GATCTGAAAA ACCAGCTGGA GGCATTGAGC AGCCGTAACG CTCCCCGACT TGAGTTGCCC GATGTTTATT TTGTGGAGCC GGAATGGGTA ATTGACTTTA CGACAGACAA TACCGTCGTT ATCCACGGAG CGGGCAACAT CGATCAACTG CTCAAAGAGA TAGCTTCTTA CGACAGCGCT CAGGTATTGA GCACCCCAAA AAGAACGCCC GTTTCCGTTC AGTGCCGCGT TACGCCAGCA GAGTATCAGG CTACCGTTCG CCAAATCAAA GAGCACATTG TAGCAGGGGA TGTGTATGAG TTAAATTACT GCATAGAGTT TTTTGCCGAA CAGGCTCAAC TCAATCCGCT GACAACGTAT CAGGCGTTGA ATGAGCGGTC GCCAATGCCT TTTTCGAACT TTATCAAGCT GGGTGATCAG TACATTATAG GAGCATCGCC CGAGCGGTTT TTGAAAAAAG AGGCCAGCCG CCTTGTAACC CAGCCCATCA AAGGCACCAT CCGGCGTGGA AAAACGCCCG ACGAAGATGC TCGTCTGCGC AATCAGCTTA TTAATTCAGA AAAGGAACGG GCCGAAAATC TGATGATCGT CGATCTGGTC AGGAACGACC TGGCCCGGAG TGCCGTAACG GGTAGTGTTC GCGTGGATGA ACTGTTTGGT ATTTATGGCT TTCGGCAAGT GTATCAGCTG ATTTCTACCG TTTCTGCCAC ACTGCGGGAT AGCGTTTCGT GGGCCGATGC ACTGCGTCAG GCGTTTCCAA TGGGCAGTAT GACGGGAGCC CCCAAGATTC GGGCCATGCA ACTTATTGAC GAACTGGAAG TGAGCAGGCG GGGAGTTTAC TCAGGCGCGG TAGGTTTTGT AACACCAGAA GGCGATTTTG ATTTTAGTGT GGTGATCCGG ACATTACTGT ACGATGCCCG GCAACAGTAT GCTTCTTTTT CGGTTGGTAG CGCCATCACC TACGATGCCG ATCCGGCGCA GGAGTGGGAG GAATGTTTAC TGAAAGCAAG CGCCATCCGG CAGGTTCTGG AGTCATAA
|
Protein sequence | MAVYVPIENP LVWRWQALAW AISQEAHSDG FVAFLNNNET TYPNGPFPNR LFVGSKRVVS FSDVAGPDAF RVLEKAHRDE PSFLVGYFGY DLKNQLEALS SRNAPRLELP DVYFVEPEWV IDFTTDNTVV IHGAGNIDQL LKEIASYDSA QVLSTPKRTP VSVQCRVTPA EYQATVRQIK EHIVAGDVYE LNYCIEFFAE QAQLNPLTTY QALNERSPMP FSNFIKLGDQ YIIGASPERF LKKEASRLVT QPIKGTIRRG KTPDEDARLR NQLINSEKER AENLMIVDLV RNDLARSAVT GSVRVDELFG IYGFRQVYQL ISTVSATLRD SVSWADALRQ AFPMGSMTGA PKIRAMQLID ELEVSRRGVY SGAVGFVTPE GDFDFSVVIR TLLYDARQQY ASFSVGSAIT YDADPAQEWE ECLLKASAIR QVLES
|
| |