Gene BAS5207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5207 
Symbol 
ID2848837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5094108 
End bp5096756 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content37% 
IMG OID637508462 
Productcollagen adhesion protein, C-terminal 
Protein accessionYP_031446 
Protein GI49188193 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGGATGG TAAAGCTATC GCTACTGGCT TACGCCCTGG CAATTATAAA TTTATCGAAG 
TATTACGATA AGAATACAAG TCCAATCAAG TTCACAATTA CAGAAAGTCA AACAACGTCA
GCTACTGTTA CTGCTAAAAA CAGCTTAACA AAAGGCGGTA TTGAATTAAC AAAAGTGAAT
GCTGCAGATG AAAAGGAAAC ACTTGAAGGT GCAGTATTTA AAATCGTTAA TAGAGATACG
AATGAAGATG CTCGTACGAA CCTTGTTACG AATAGTGAAG GAAAGTTAGT TGTAGACGAC
TTACGTCCTG GTAACTATAA ACTTATCGAA ACAAAGGCTC CAACTTATTA TGATGTAAAT
GTAGAACCAA TTGAATTTAC AATCGAAAAA GGACAACAAA CACTTCTTCC ACTTACATTT
AAAAATAGTT TAACGAAAGG AAAAGTTAAA CTTATTAAAG AAGATGATGT GGAGAGTAGT
ATCGCTCTTG CTGGAGCAGT GTTCACATTA CAAGACGCAA ACGGTACAGA AATTGCGAAA
GATTTAAAAA CAGATGAGCA CGGAGTACTA GTTATTCCTG ACTTAGCACC AGGAGATTAC
CAATTTATTG AAACAGCCGC TCCTGAACAT TATAAATTAG ATCAAACACC TATTAAGTTC
ACAATCGAAA GAAGTCAAAC GAAACATGTC TTCGTTACAG CTACTAACAG CTTAACGAAA
GGTAGTGTGG AGTTAATTAA AGTTGATGAT GTTGAAGAAA ACACAACACT CGAGGGTGCT
GTATTTAAAA TTGTTAATAA GGATGGACAC GATGTCCGTA CTGATTTAAC TACTGATAAA
AACGGCCGTT TAGTTGTTGA TGAATTACCA CCTGGAGACT ATGAGTTTAT CGAAACAAAA
GCTCCTACTC ATTATGACTT AAACGAAACA CCTATTAAGT TCACAGTTAA AAAAGGACAA
GAAAAAATCG CTTCCGTTAC TGCTACGAAT AGCTTAACGA AAGGCGCTGT GGAACTTTCT
AAAGTAGATG ACATTGACGG TTCCACTCTT AAAGATGCAG TATTTAAAAT CGTGGATATG
AACGGCAATG ACGTTCGCAC TGATTTAACT ACTAATAAAG ATGGCAAAAT CTCTGTTTCT
GATTTACGGC CGGGTGACTA TCAGTTCGTT GAAACGAAAG CTCCTACTCA TTACGACTTA
AATCAAACTC CTATCAACTT TACTGTTGAA AAGAGTCAAA CCGCTACAGC TTCTGTTACC
GCTAAGAATA GCTTAACGAA AGGCGCTGTG GAATTAACGA AAGTTGATGA CATCGACGGG
ACTACTCTTG AAGGAGCTAT CTTTAAAATT GTTGATCAAA ATGGCAATGA TGTTCGCACT
GATTTAACTA CTGATAAAGA TGGCAAAATC TCTGTTTCTG ATTTACGGCC GGGTGACTAT
CAGTTCGTTG AAACGAAAGC TCCTACTCAT TACGACTTAA ATCAAACTCC TATCAACTTT
ACTGTTGAAA AGAGTCAAAC CGCTACAGCT TCTGTTACTG CTACAAATAG CTTAACGAAG
GGCGCTGTGG AATTAACGAA AGTTGATGAC ATCGACGGGG CTACTCTTGA AGGAGCAGTA
TTTAAAATCG TGGATATGAA CGGCAATGAC GTTCGCACTG ATTTAACTAC TGATAAAGAT
GGCAAAATCT CTGTTTCTGA TTTACGTCCG GGTGACTATC AGTTCATCGA AACGAAAGCT
CCTAAGCATT ATGACTTAAA TCAAAACCCT ATCAACTTTA CTGTTGAAAA GAGTCAAACC
GCTACAGCTT CTGTTACTGC TACAAATAGC TTAACGAAAG GTGCTGTAGA ATTAATGAAA
GTCGATGATA TTGACGGAAC TACTCTTGAA GGGGCTATCT TTAAAATTGT TGATTCTAAC
GGACATGATG TTCGCGCTGA TTTAACTACT GATAAAGATG GCAAAATTTC TGTTTCTGAT
TTACGTCCAG GAGACTATCA GTTTATTGAA ACAAAAGCTC CTACTGGATA CGATTTAAAC
GCTAAGCCAA TTCCTTTCAC TATTACGAAA GGACAATCTC AAGTTACTTC TGTGACTGCT
TTAAATAGTT TAACAACAGG TTCGATGGAG TTAACTAAAG TTGATATTGA TCATAACGGA
ACACTTGAAG GTGCCATTTT CAATATTTTA GATCAAGATG GAAAAGTAGT ACGAGAAGGC
TTAAAAACAG ATGGGCATGG TAAATTAATC GTAAATGATT TGAAACCTGG TAATTATCAA
CTAGTGGAAA CAAAGGCTCC TGAAGGCTAT CAATTAGATG CATCACCAAT AAGCTTTACT
ATTGAAAAAG CCCAAGCTTC ACCACTACAA ATTACAGTTT CAAATAAAAA GGTTGAGTCT
TCATCAGGCG GGGATAATAA ACCAATTACT CCTCCGAATA AAGAAGAAAA ACCAGGTAAG
GAAACTTCCG AAGAACTAGA AAACGGAAAT CCTAAAACAC AGACAAACAA ACAGCAAGAT
GACCGAAATA CAGGTAAAGA ACTTCCAAAT ACAGGTCACA AAAATGATTC TACCCAAACA
GTCGGTATCA TTCTTCTATT AGCTGGATTG TTAAGTGTTT TAGCTACAAA ACGAAAAAAA
TATTATTAA
 
Protein sequence
MRMVKLSLLA YALAIINLSK YYDKNTSPIK FTITESQTTS ATVTAKNSLT KGGIELTKVN 
AADEKETLEG AVFKIVNRDT NEDARTNLVT NSEGKLVVDD LRPGNYKLIE TKAPTYYDVN
VEPIEFTIEK GQQTLLPLTF KNSLTKGKVK LIKEDDVESS IALAGAVFTL QDANGTEIAK
DLKTDEHGVL VIPDLAPGDY QFIETAAPEH YKLDQTPIKF TIERSQTKHV FVTATNSLTK
GSVELIKVDD VEENTTLEGA VFKIVNKDGH DVRTDLTTDK NGRLVVDELP PGDYEFIETK
APTHYDLNET PIKFTVKKGQ EKIASVTATN SLTKGAVELS KVDDIDGSTL KDAVFKIVDM
NGNDVRTDLT TNKDGKISVS DLRPGDYQFV ETKAPTHYDL NQTPINFTVE KSQTATASVT
AKNSLTKGAV ELTKVDDIDG TTLEGAIFKI VDQNGNDVRT DLTTDKDGKI SVSDLRPGDY
QFVETKAPTH YDLNQTPINF TVEKSQTATA SVTATNSLTK GAVELTKVDD IDGATLEGAV
FKIVDMNGND VRTDLTTDKD GKISVSDLRP GDYQFIETKA PKHYDLNQNP INFTVEKSQT
ATASVTATNS LTKGAVELMK VDDIDGTTLE GAIFKIVDSN GHDVRADLTT DKDGKISVSD
LRPGDYQFIE TKAPTGYDLN AKPIPFTITK GQSQVTSVTA LNSLTTGSME LTKVDIDHNG
TLEGAIFNIL DQDGKVVREG LKTDGHGKLI VNDLKPGNYQ LVETKAPEGY QLDASPISFT
IEKAQASPLQ ITVSNKKVES SSGGDNKPIT PPNKEEKPGK ETSEELENGN PKTQTNKQQD
DRNTGKELPN TGHKNDSTQT VGIILLLAGL LSVLATKRKK YY