Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS5207 |
Symbol | |
ID | 2848837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | + |
Start bp | 5094108 |
End bp | 5096756 |
Gene Length | 2649 bp |
Protein Length | 882 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637508462 |
Product | collagen adhesion protein, C-terminal |
Protein accession | YP_031446 |
Protein GI | 49188193 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4932] Predicted outer membrane protein |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCGGATGG TAAAGCTATC GCTACTGGCT TACGCCCTGG CAATTATAAA TTTATCGAAG TATTACGATA AGAATACAAG TCCAATCAAG TTCACAATTA CAGAAAGTCA AACAACGTCA GCTACTGTTA CTGCTAAAAA CAGCTTAACA AAAGGCGGTA TTGAATTAAC AAAAGTGAAT GCTGCAGATG AAAAGGAAAC ACTTGAAGGT GCAGTATTTA AAATCGTTAA TAGAGATACG AATGAAGATG CTCGTACGAA CCTTGTTACG AATAGTGAAG GAAAGTTAGT TGTAGACGAC TTACGTCCTG GTAACTATAA ACTTATCGAA ACAAAGGCTC CAACTTATTA TGATGTAAAT GTAGAACCAA TTGAATTTAC AATCGAAAAA GGACAACAAA CACTTCTTCC ACTTACATTT AAAAATAGTT TAACGAAAGG AAAAGTTAAA CTTATTAAAG AAGATGATGT GGAGAGTAGT ATCGCTCTTG CTGGAGCAGT GTTCACATTA CAAGACGCAA ACGGTACAGA AATTGCGAAA GATTTAAAAA CAGATGAGCA CGGAGTACTA GTTATTCCTG ACTTAGCACC AGGAGATTAC CAATTTATTG AAACAGCCGC TCCTGAACAT TATAAATTAG ATCAAACACC TATTAAGTTC ACAATCGAAA GAAGTCAAAC GAAACATGTC TTCGTTACAG CTACTAACAG CTTAACGAAA GGTAGTGTGG AGTTAATTAA AGTTGATGAT GTTGAAGAAA ACACAACACT CGAGGGTGCT GTATTTAAAA TTGTTAATAA GGATGGACAC GATGTCCGTA CTGATTTAAC TACTGATAAA AACGGCCGTT TAGTTGTTGA TGAATTACCA CCTGGAGACT ATGAGTTTAT CGAAACAAAA GCTCCTACTC ATTATGACTT AAACGAAACA CCTATTAAGT TCACAGTTAA AAAAGGACAA GAAAAAATCG CTTCCGTTAC TGCTACGAAT AGCTTAACGA AAGGCGCTGT GGAACTTTCT AAAGTAGATG ACATTGACGG TTCCACTCTT AAAGATGCAG TATTTAAAAT CGTGGATATG AACGGCAATG ACGTTCGCAC TGATTTAACT ACTAATAAAG ATGGCAAAAT CTCTGTTTCT GATTTACGGC CGGGTGACTA TCAGTTCGTT GAAACGAAAG CTCCTACTCA TTACGACTTA AATCAAACTC CTATCAACTT TACTGTTGAA AAGAGTCAAA CCGCTACAGC TTCTGTTACC GCTAAGAATA GCTTAACGAA AGGCGCTGTG GAATTAACGA AAGTTGATGA CATCGACGGG ACTACTCTTG AAGGAGCTAT CTTTAAAATT GTTGATCAAA ATGGCAATGA TGTTCGCACT GATTTAACTA CTGATAAAGA TGGCAAAATC TCTGTTTCTG ATTTACGGCC GGGTGACTAT CAGTTCGTTG AAACGAAAGC TCCTACTCAT TACGACTTAA ATCAAACTCC TATCAACTTT ACTGTTGAAA AGAGTCAAAC CGCTACAGCT TCTGTTACTG CTACAAATAG CTTAACGAAG GGCGCTGTGG AATTAACGAA AGTTGATGAC ATCGACGGGG CTACTCTTGA AGGAGCAGTA TTTAAAATCG TGGATATGAA CGGCAATGAC GTTCGCACTG ATTTAACTAC TGATAAAGAT GGCAAAATCT CTGTTTCTGA TTTACGTCCG GGTGACTATC AGTTCATCGA AACGAAAGCT CCTAAGCATT ATGACTTAAA TCAAAACCCT ATCAACTTTA CTGTTGAAAA GAGTCAAACC GCTACAGCTT CTGTTACTGC TACAAATAGC TTAACGAAAG GTGCTGTAGA ATTAATGAAA GTCGATGATA TTGACGGAAC TACTCTTGAA GGGGCTATCT TTAAAATTGT TGATTCTAAC GGACATGATG TTCGCGCTGA TTTAACTACT GATAAAGATG GCAAAATTTC TGTTTCTGAT TTACGTCCAG GAGACTATCA GTTTATTGAA ACAAAAGCTC CTACTGGATA CGATTTAAAC GCTAAGCCAA TTCCTTTCAC TATTACGAAA GGACAATCTC AAGTTACTTC TGTGACTGCT TTAAATAGTT TAACAACAGG TTCGATGGAG TTAACTAAAG TTGATATTGA TCATAACGGA ACACTTGAAG GTGCCATTTT CAATATTTTA GATCAAGATG GAAAAGTAGT ACGAGAAGGC TTAAAAACAG ATGGGCATGG TAAATTAATC GTAAATGATT TGAAACCTGG TAATTATCAA CTAGTGGAAA CAAAGGCTCC TGAAGGCTAT CAATTAGATG CATCACCAAT AAGCTTTACT ATTGAAAAAG CCCAAGCTTC ACCACTACAA ATTACAGTTT CAAATAAAAA GGTTGAGTCT TCATCAGGCG GGGATAATAA ACCAATTACT CCTCCGAATA AAGAAGAAAA ACCAGGTAAG GAAACTTCCG AAGAACTAGA AAACGGAAAT CCTAAAACAC AGACAAACAA ACAGCAAGAT GACCGAAATA CAGGTAAAGA ACTTCCAAAT ACAGGTCACA AAAATGATTC TACCCAAACA GTCGGTATCA TTCTTCTATT AGCTGGATTG TTAAGTGTTT TAGCTACAAA ACGAAAAAAA TATTATTAA
|
Protein sequence | MRMVKLSLLA YALAIINLSK YYDKNTSPIK FTITESQTTS ATVTAKNSLT KGGIELTKVN AADEKETLEG AVFKIVNRDT NEDARTNLVT NSEGKLVVDD LRPGNYKLIE TKAPTYYDVN VEPIEFTIEK GQQTLLPLTF KNSLTKGKVK LIKEDDVESS IALAGAVFTL QDANGTEIAK DLKTDEHGVL VIPDLAPGDY QFIETAAPEH YKLDQTPIKF TIERSQTKHV FVTATNSLTK GSVELIKVDD VEENTTLEGA VFKIVNKDGH DVRTDLTTDK NGRLVVDELP PGDYEFIETK APTHYDLNET PIKFTVKKGQ EKIASVTATN SLTKGAVELS KVDDIDGSTL KDAVFKIVDM NGNDVRTDLT TNKDGKISVS DLRPGDYQFV ETKAPTHYDL NQTPINFTVE KSQTATASVT AKNSLTKGAV ELTKVDDIDG TTLEGAIFKI VDQNGNDVRT DLTTDKDGKI SVSDLRPGDY QFVETKAPTH YDLNQTPINF TVEKSQTATA SVTATNSLTK GAVELTKVDD IDGATLEGAV FKIVDMNGND VRTDLTTDKD GKISVSDLRP GDYQFIETKA PKHYDLNQNP INFTVEKSQT ATASVTATNS LTKGAVELMK VDDIDGTTLE GAIFKIVDSN GHDVRADLTT DKDGKISVSD LRPGDYQFIE TKAPTGYDLN AKPIPFTITK GQSQVTSVTA LNSLTTGSME LTKVDIDHNG TLEGAIFNIL DQDGKVVREG LKTDGHGKLI VNDLKPGNYQ LVETKAPEGY QLDASPISFT IEKAQASPLQ ITVSNKKVES SSGGDNKPIT PPNKEEKPGK ETSEELENGN PKTQTNKQQD DRNTGKELPN TGHKNDSTQT VGIILLLAGL LSVLATKRKK YY
|
| |