Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS5206 |
Symbol | |
ID | 2848219 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | + |
Start bp | 5090870 |
End bp | 5094178 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637508461 |
Product | collagen adhesion protein |
Protein accession | YP_031445 |
Protein GI | 49188192 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCCAAA GACCCAAACA ATTGAATGGA CGATTACTTA CAACGGTGAT CAAAGAAATA TCAAAAAAAA CAGATGCACT TTTAAAAGAT ATTTTTGACG ATACACATGA ATTAGATGTA AATTCTATCG TTGTGAAAAA CGCTTCATAC GACGATAAAG GAAGGCTTGT AACAGGAGAC ACTGTTAATA ATTACACTGT AACTAACAAG AAGAATGGAT TCGACTTACA GTTTAATGAA GATATTAATA GTGCGTATGT AATTACCTAT AAAACAAAGC CAACTAATAA TGTCATAAAA GATGGAAAAA TAAAGAATAC AGTTACAGCT GATAATGGCT CAAGTAAAGA AAATGAGGCT GGTTTCCAGC AGCAAAATAT TATTAAATCT AATAATAAAG CTGAAACAAA CTATAAAGAC AAAACAACAA CCTGGACAAT TACGGTAAAT AATAACAACT ATCCATTAAA CAACGCGATC ATTACGGATA CCTTTGACCA CGGTGGATTA CAATTAAAAG ATAAAAAACT AGAAATTAAA GACGGAGATT ATACCCTTCA GGCTGAGACT GATTATGTTT TAGATGTAAC AGATAAAGGT TTCAAAATTA CCCTTATAGG TACATATCAG TCTAATATGA CAAAGACATT AATCGTAAAA TATACGACAG ACTTTGACTA TACAAAACTA GAAAGTGGTA AAACTTCATT TAAAAATACA GGTAACCTAT CTTGGATAGA TGCAGGGTCC AATCCACAAT CAAATAAAGT TGAAGCAGAC TTTGATCCTG ATACTTTCAC AAAGGCAAAT GGCTATAAAT ACGGTTCTTA TAACGCCCAA ACGAAAGAAA TCACTTGGAT AATAGGTTTT AATTATAATA ATGTTGAGAT TAAAGATCCC TATGTTATAG ACGTAATACA AGATAAACAA AAGTTAGTAC CAGGATCCAT TGAAGTGCGC CATATGATTT TAAATGGAAG TCCGGATAAT GCAAGACCTG GTGATGCTGT ACCAATTGAG CAGTATGAAC TTGAAGAACC TACAGATAAA AATAAAAACA CCCTACAGGT TCATTTCAAG CAATCAATTA ATTCACCTTA CTATATTATC TTTAAAACAA GCCTTGATGG TGAACTTATC CAAAACACCT ACAAAAACGA GGCAGAGTTA AAAGATGGCT CTAAAATTGT AAACACTCTT AAAGGTGACG CTCAAGTAAA TAAAGGTGGC AGTTTCGTTA CTAAAAAAGC GGTGCAAGAT GACAACTATA TTAATTGGAG TATTGCAATT AACGAAAGCC AATCGACCAT TGCAGACGCA GTTGTAACAG ACGATCCAAC AGACAATCAA GTACTTGTGG AAGATTCATT CCACTTATAT CCTACAACTG TCGATTACTA TGGAAATGTA ACAAAAGATA CAGCAAATGA ATTAAAACAA GGAACAGACT ATAAGTTAAC GATCACAACA GACAATAATA CCGGAAAGCA ACATTTCGAA ATTGCCTTCT TGAAAAAAAT TGATCGAGCT TATATTTTAG AATATCGCTC ACTTATTAAT GCAGACGATA AAGAAAAAGT GAGTAATAAA GCAAAAATCG CTGGCAATCA GTTAACAGTT AAGAATACAG AAACTGTTGA AACGATTGAA GTGAGAATGT CTTCCGGTTC AGGTGGAGGA TCTGCTACTA ATGGGCGCGG AAACCTTGAA ATTATAAAGG TAGATAACGA CAATAAGAAA GTACCATTGT CCGGAGCAGA ATTTACTTTA TATGATCGTA CAGGAAAAAC CGTTATCCGC AAAATAACAA CAGACAAAGA TGGTATTGCT AAGTTTAATA ACTTAAAGCG TGATAAATAT TTATTAAAAG AAACAAAAGC TCCTGAGGGC TATGTAATTA GCTGGGATTT AAAACAGGGA AAAATTGTTG AACTTGGTTC ACAAGAGACT ACAACATATA AACTTGCAAA TAAGAAATTT GTCGGTAAAG TAGTTTTAAC AAAGTCTGAT GACCTGAATA AAAACGTAAC ACTGCAAGGT GCCGTTTTCA CACTACTTGA TAAAGATAAA AAAATAATTT CAGAACACGA AAAATTAACA ACAAATGATC AAGGACAAAT TACTGTAGAT AATTTAAAAC CAGGAACTTA CTACTTACAA GAAACAACCG CTCCTGAGCA CTATAAATTA GACAGTACAC CTATTCAATT CACAATCAAA GAAGATCAAA CAACAGTAAT AAACCGAACT GCAACAAACA GTTTGATTCC AGGATCGGCC ATTTTAACAA AAGTCGATAA AGATGGAAAA ACTTTAGCAG GTGCGGAGTT TAGTGTACGC GATCGACATA ACAATGTAAT TCGTGGATAT GAAAAATTAA CGACAAATGA TCAAGGACAA ATCGAAGCAA CAAATTTACG TCCTGGAGAC TATCAATTTG TTGAAGAAAA AGCTCCTAAA GATTACGACA TAGATAAAAC ACCAATTGAG TTTACAATTG TAAAAAGTCA AAAGAAAGCA GTTACTGTTA CTGCGACAAA TCACCTCATT AAAGGTGGCG TCACTTTAAC AAAAACTGAT GATATCGATG GTACAGCTCT TGCAGGTGCT ATATTTAAAA TTGTCGATGC TAATGATGAA AAGAAAGTCA TTCGCGAAAA TGTAAAAACT GGTGCGGATG GTAAAGTAAC TGTTAAAGAC TTGGAGCCTG GTACGTATAA ATTTATTGAA ACAGAAGCTC CTAAGGATTA TGTGTTAAAT GCAAATCCTA TCGAGTTTAC AATCGATAAA AGCCAACAGT CTTTTGCTAC TGTTACGGCA ACAAACAGTT TAAAAACAGG AGAAGTTGAA TTATTAAAAG TTGATGAATT CGGTGATAAA AAACCTCTAA AAGGTGCCGT ATTTAAAATT GTTGATGTAA ATAATAATGA TGTTCGTACT GATCTAACTA CAGATGCTGA TGGAAAAACA AAAGCTGATA AGCTACGTCC TGGTACGTAT AAATTTATTG AAACAGCTGC TCCTGAACAT TACGTTTTAC GAGCAGAACC TATTGAATTT ACAATCGATA GAAGTCAGAA AGAAACCCTA CTTGTGAAAG CTGAAAATGC TTTAAAACCA GGCGATGTTG AATTAACGAA AGTTGATGAT ATCGATGGTA CAGCTCTTGC AGGTGCTGTA TTTAAAATTG TTGATGCTAA TGATGAAAAG AAAGTCATTC GCGAAAATGT AAAAACTGGT GCGGATGGTA AAGCTATCGC TACTGGCTTA CGCCCTGGCA ATTATAAATT TATCGAAGTA TTACGATAA
|
Protein sequence | MIQRPKQLNG RLLTTVIKEI SKKTDALLKD IFDDTHELDV NSIVVKNASY DDKGRLVTGD TVNNYTVTNK KNGFDLQFNE DINSAYVITY KTKPTNNVIK DGKIKNTVTA DNGSSKENEA GFQQQNIIKS NNKAETNYKD KTTTWTITVN NNNYPLNNAI ITDTFDHGGL QLKDKKLEIK DGDYTLQAET DYVLDVTDKG FKITLIGTYQ SNMTKTLIVK YTTDFDYTKL ESGKTSFKNT GNLSWIDAGS NPQSNKVEAD FDPDTFTKAN GYKYGSYNAQ TKEITWIIGF NYNNVEIKDP YVIDVIQDKQ KLVPGSIEVR HMILNGSPDN ARPGDAVPIE QYELEEPTDK NKNTLQVHFK QSINSPYYII FKTSLDGELI QNTYKNEAEL KDGSKIVNTL KGDAQVNKGG SFVTKKAVQD DNYINWSIAI NESQSTIADA VVTDDPTDNQ VLVEDSFHLY PTTVDYYGNV TKDTANELKQ GTDYKLTITT DNNTGKQHFE IAFLKKIDRA YILEYRSLIN ADDKEKVSNK AKIAGNQLTV KNTETVETIE VRMSSGSGGG SATNGRGNLE IIKVDNDNKK VPLSGAEFTL YDRTGKTVIR KITTDKDGIA KFNNLKRDKY LLKETKAPEG YVISWDLKQG KIVELGSQET TTYKLANKKF VGKVVLTKSD DLNKNVTLQG AVFTLLDKDK KIISEHEKLT TNDQGQITVD NLKPGTYYLQ ETTAPEHYKL DSTPIQFTIK EDQTTVINRT ATNSLIPGSA ILTKVDKDGK TLAGAEFSVR DRHNNVIRGY EKLTTNDQGQ IEATNLRPGD YQFVEEKAPK DYDIDKTPIE FTIVKSQKKA VTVTATNHLI KGGVTLTKTD DIDGTALAGA IFKIVDANDE KKVIRENVKT GADGKVTVKD LEPGTYKFIE TEAPKDYVLN ANPIEFTIDK SQQSFATVTA TNSLKTGEVE LLKVDEFGDK KPLKGAVFKI VDVNNNDVRT DLTTDADGKT KADKLRPGTY KFIETAAPEH YVLRAEPIEF TIDRSQKETL LVKAENALKP GDVELTKVDD IDGTALAGAV FKIVDANDEK KVIRENVKTG ADGKAIATGL RPGNYKFIEV LR
|
| |