Gene BAS5206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5206 
Symbol 
ID2848219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5090870 
End bp5094178 
Gene Length3309 bp 
Protein Length1102 aa 
Translation table11 
GC content34% 
IMG OID637508461 
Productcollagen adhesion protein 
Protein accessionYP_031445 
Protein GI49188192 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCAAA GACCCAAACA ATTGAATGGA CGATTACTTA CAACGGTGAT CAAAGAAATA 
TCAAAAAAAA CAGATGCACT TTTAAAAGAT ATTTTTGACG ATACACATGA ATTAGATGTA
AATTCTATCG TTGTGAAAAA CGCTTCATAC GACGATAAAG GAAGGCTTGT AACAGGAGAC
ACTGTTAATA ATTACACTGT AACTAACAAG AAGAATGGAT TCGACTTACA GTTTAATGAA
GATATTAATA GTGCGTATGT AATTACCTAT AAAACAAAGC CAACTAATAA TGTCATAAAA
GATGGAAAAA TAAAGAATAC AGTTACAGCT GATAATGGCT CAAGTAAAGA AAATGAGGCT
GGTTTCCAGC AGCAAAATAT TATTAAATCT AATAATAAAG CTGAAACAAA CTATAAAGAC
AAAACAACAA CCTGGACAAT TACGGTAAAT AATAACAACT ATCCATTAAA CAACGCGATC
ATTACGGATA CCTTTGACCA CGGTGGATTA CAATTAAAAG ATAAAAAACT AGAAATTAAA
GACGGAGATT ATACCCTTCA GGCTGAGACT GATTATGTTT TAGATGTAAC AGATAAAGGT
TTCAAAATTA CCCTTATAGG TACATATCAG TCTAATATGA CAAAGACATT AATCGTAAAA
TATACGACAG ACTTTGACTA TACAAAACTA GAAAGTGGTA AAACTTCATT TAAAAATACA
GGTAACCTAT CTTGGATAGA TGCAGGGTCC AATCCACAAT CAAATAAAGT TGAAGCAGAC
TTTGATCCTG ATACTTTCAC AAAGGCAAAT GGCTATAAAT ACGGTTCTTA TAACGCCCAA
ACGAAAGAAA TCACTTGGAT AATAGGTTTT AATTATAATA ATGTTGAGAT TAAAGATCCC
TATGTTATAG ACGTAATACA AGATAAACAA AAGTTAGTAC CAGGATCCAT TGAAGTGCGC
CATATGATTT TAAATGGAAG TCCGGATAAT GCAAGACCTG GTGATGCTGT ACCAATTGAG
CAGTATGAAC TTGAAGAACC TACAGATAAA AATAAAAACA CCCTACAGGT TCATTTCAAG
CAATCAATTA ATTCACCTTA CTATATTATC TTTAAAACAA GCCTTGATGG TGAACTTATC
CAAAACACCT ACAAAAACGA GGCAGAGTTA AAAGATGGCT CTAAAATTGT AAACACTCTT
AAAGGTGACG CTCAAGTAAA TAAAGGTGGC AGTTTCGTTA CTAAAAAAGC GGTGCAAGAT
GACAACTATA TTAATTGGAG TATTGCAATT AACGAAAGCC AATCGACCAT TGCAGACGCA
GTTGTAACAG ACGATCCAAC AGACAATCAA GTACTTGTGG AAGATTCATT CCACTTATAT
CCTACAACTG TCGATTACTA TGGAAATGTA ACAAAAGATA CAGCAAATGA ATTAAAACAA
GGAACAGACT ATAAGTTAAC GATCACAACA GACAATAATA CCGGAAAGCA ACATTTCGAA
ATTGCCTTCT TGAAAAAAAT TGATCGAGCT TATATTTTAG AATATCGCTC ACTTATTAAT
GCAGACGATA AAGAAAAAGT GAGTAATAAA GCAAAAATCG CTGGCAATCA GTTAACAGTT
AAGAATACAG AAACTGTTGA AACGATTGAA GTGAGAATGT CTTCCGGTTC AGGTGGAGGA
TCTGCTACTA ATGGGCGCGG AAACCTTGAA ATTATAAAGG TAGATAACGA CAATAAGAAA
GTACCATTGT CCGGAGCAGA ATTTACTTTA TATGATCGTA CAGGAAAAAC CGTTATCCGC
AAAATAACAA CAGACAAAGA TGGTATTGCT AAGTTTAATA ACTTAAAGCG TGATAAATAT
TTATTAAAAG AAACAAAAGC TCCTGAGGGC TATGTAATTA GCTGGGATTT AAAACAGGGA
AAAATTGTTG AACTTGGTTC ACAAGAGACT ACAACATATA AACTTGCAAA TAAGAAATTT
GTCGGTAAAG TAGTTTTAAC AAAGTCTGAT GACCTGAATA AAAACGTAAC ACTGCAAGGT
GCCGTTTTCA CACTACTTGA TAAAGATAAA AAAATAATTT CAGAACACGA AAAATTAACA
ACAAATGATC AAGGACAAAT TACTGTAGAT AATTTAAAAC CAGGAACTTA CTACTTACAA
GAAACAACCG CTCCTGAGCA CTATAAATTA GACAGTACAC CTATTCAATT CACAATCAAA
GAAGATCAAA CAACAGTAAT AAACCGAACT GCAACAAACA GTTTGATTCC AGGATCGGCC
ATTTTAACAA AAGTCGATAA AGATGGAAAA ACTTTAGCAG GTGCGGAGTT TAGTGTACGC
GATCGACATA ACAATGTAAT TCGTGGATAT GAAAAATTAA CGACAAATGA TCAAGGACAA
ATCGAAGCAA CAAATTTACG TCCTGGAGAC TATCAATTTG TTGAAGAAAA AGCTCCTAAA
GATTACGACA TAGATAAAAC ACCAATTGAG TTTACAATTG TAAAAAGTCA AAAGAAAGCA
GTTACTGTTA CTGCGACAAA TCACCTCATT AAAGGTGGCG TCACTTTAAC AAAAACTGAT
GATATCGATG GTACAGCTCT TGCAGGTGCT ATATTTAAAA TTGTCGATGC TAATGATGAA
AAGAAAGTCA TTCGCGAAAA TGTAAAAACT GGTGCGGATG GTAAAGTAAC TGTTAAAGAC
TTGGAGCCTG GTACGTATAA ATTTATTGAA ACAGAAGCTC CTAAGGATTA TGTGTTAAAT
GCAAATCCTA TCGAGTTTAC AATCGATAAA AGCCAACAGT CTTTTGCTAC TGTTACGGCA
ACAAACAGTT TAAAAACAGG AGAAGTTGAA TTATTAAAAG TTGATGAATT CGGTGATAAA
AAACCTCTAA AAGGTGCCGT ATTTAAAATT GTTGATGTAA ATAATAATGA TGTTCGTACT
GATCTAACTA CAGATGCTGA TGGAAAAACA AAAGCTGATA AGCTACGTCC TGGTACGTAT
AAATTTATTG AAACAGCTGC TCCTGAACAT TACGTTTTAC GAGCAGAACC TATTGAATTT
ACAATCGATA GAAGTCAGAA AGAAACCCTA CTTGTGAAAG CTGAAAATGC TTTAAAACCA
GGCGATGTTG AATTAACGAA AGTTGATGAT ATCGATGGTA CAGCTCTTGC AGGTGCTGTA
TTTAAAATTG TTGATGCTAA TGATGAAAAG AAAGTCATTC GCGAAAATGT AAAAACTGGT
GCGGATGGTA AAGCTATCGC TACTGGCTTA CGCCCTGGCA ATTATAAATT TATCGAAGTA
TTACGATAA
 
Protein sequence
MIQRPKQLNG RLLTTVIKEI SKKTDALLKD IFDDTHELDV NSIVVKNASY DDKGRLVTGD 
TVNNYTVTNK KNGFDLQFNE DINSAYVITY KTKPTNNVIK DGKIKNTVTA DNGSSKENEA
GFQQQNIIKS NNKAETNYKD KTTTWTITVN NNNYPLNNAI ITDTFDHGGL QLKDKKLEIK
DGDYTLQAET DYVLDVTDKG FKITLIGTYQ SNMTKTLIVK YTTDFDYTKL ESGKTSFKNT
GNLSWIDAGS NPQSNKVEAD FDPDTFTKAN GYKYGSYNAQ TKEITWIIGF NYNNVEIKDP
YVIDVIQDKQ KLVPGSIEVR HMILNGSPDN ARPGDAVPIE QYELEEPTDK NKNTLQVHFK
QSINSPYYII FKTSLDGELI QNTYKNEAEL KDGSKIVNTL KGDAQVNKGG SFVTKKAVQD
DNYINWSIAI NESQSTIADA VVTDDPTDNQ VLVEDSFHLY PTTVDYYGNV TKDTANELKQ
GTDYKLTITT DNNTGKQHFE IAFLKKIDRA YILEYRSLIN ADDKEKVSNK AKIAGNQLTV
KNTETVETIE VRMSSGSGGG SATNGRGNLE IIKVDNDNKK VPLSGAEFTL YDRTGKTVIR
KITTDKDGIA KFNNLKRDKY LLKETKAPEG YVISWDLKQG KIVELGSQET TTYKLANKKF
VGKVVLTKSD DLNKNVTLQG AVFTLLDKDK KIISEHEKLT TNDQGQITVD NLKPGTYYLQ
ETTAPEHYKL DSTPIQFTIK EDQTTVINRT ATNSLIPGSA ILTKVDKDGK TLAGAEFSVR
DRHNNVIRGY EKLTTNDQGQ IEATNLRPGD YQFVEEKAPK DYDIDKTPIE FTIVKSQKKA
VTVTATNHLI KGGVTLTKTD DIDGTALAGA IFKIVDANDE KKVIRENVKT GADGKVTVKD
LEPGTYKFIE TEAPKDYVLN ANPIEFTIDK SQQSFATVTA TNSLKTGEVE LLKVDEFGDK
KPLKGAVFKI VDVNNNDVRT DLTTDADGKT KADKLRPGTY KFIETAAPEH YVLRAEPIEF
TIDRSQKETL LVKAENALKP GDVELTKVDD IDGTALAGAV FKIVDANDEK KVIRENVKTG
ADGKAIATGL RPGNYKFIEV LR