Gene BAS5205 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5205 
Symbol 
ID2848218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5089114 
End bp5090967 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content33% 
IMG OID637508460 
Productcollagen adhesion protein, N-terminus 
Protein accessionYP_031444 
Protein GI49188191 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAAG GAGGCAAAAT GAAAAAACTT TTCAATATAT GTTTAATTGT ATTTGTACTA 
TTTTCACAGT TTATTAGTTT CCCGTACAAT CAGGCAAAAG CTGAGACTTT AAAGGAAACT
TCATTATTTG ACACTGTTGA GATGAAAGAT GCAACGGATC AGATTATTGA CGAAGCAAAA
AATCCTAACA ACTTAATAAA GATAGGATCA ACCATTCAAG TAGAATATGC TTGGTCAATA
AAGGATCAAC AAGTTGTACA TGCAAACGAT ACAGCAGTAC TTCAAATACC ACTTGCATTA
AAAGTATCTA AAGATTTACA AGGAGATTTA GTAACAGATC AAAAAAATAT TGGTCAATAT
TTTATAACAG CTAAAGATAA TAAATTAAAA CTAATATTTA ATGATCAAGT AGAAAATTCG
AAAGACGCTA AAGGAAAAAT TAAAATTGAT ACTGTGTTTA ACCCAACTTT AAAGACTGAA
GAAAAATCAG TTCAAATCGC TTTTCCTTTA GGAACACTGG TTCAGCCTAT AACAGTTCCT
ATTCAAGTAG AAGATTCTAA AGAAGATGGA ACCAAACAGG ATACTAATAA ACAAGTGCAA
GATCAAGTAG CTAAACCTAC TACTGATAAT CCGGAACAAA ATCCAGCAAC TAAACCTGCT
ACTGACAATC CGGAACAAAA TTCAGCAACT AAACCTGCTA CTGACAATCC GGAACAAAAT
CCAGCAACTA AACCTGCTAC TGACAATCCG GAACAAAATC CAGCAACTAA ACCTGCTACT
GATAATCCGG AACAAAATCC AGCAACTAAA CCTGCTGCTG ATAACCCAGA ACAAAATCTA
GCAAGCGATC CTGCTGAGAT TACAAATTCA GGTCCAAAGC AAATAACAAC AAACATTTTA
ACGGGTGTAA AGTTGACGGA CAAAGACGGA AAACCATTTA CAGAGGATAA CCGTCCAAGT
ACAGATTCCC CTGCCAATAT TGAGTTTACA TGGGAACTTT TAAAATCAAT GAATGTGAAA
AGCGGAGATT ACTATATTTT TGATCTTCCT AAACATTTTA AGATTTACAA TACAATTAAC
AGCCCTTTAT ACGATAGTGA AAACAATCCA ATTGGTAATT TTACTGTTAC AAAAGATGGA
AAAGTCACAA TGACATTCAA CGATTATGTT GAAGAACATC CAGATGTTGT TGGTAACCTA
CAATTAAAGA CAGAATTTAA TAAAGCTGAA ATTAAGGGTA CAACAACACA GGAAATTCCT
TTCCCAATTA AAGATAAAGA TGTTTCTATT ACAGTTGACT TTAAACCTAA TGTACAAACG
GCTACAAATA AAAAAGGGTT ACCTGATAGA CCAATTAATA CAAATGAGAT TAATTGGACA
GTAGAGATGA ACAAAACGAA AGACACCCTT AAAAACGCTG TTTTTAAAGA TAACATCCCA
CAAGGTACAA GTTTAAATAA GGATTCTATT AAAGTTTATT ATTTAGAAGT TGATGTTAAC
GGGAATGCAA CACGTGGTCA AGAAGCTGAT CCAGCAGATT ACAAAATTAT TTCATCAGAT
GGTTCAAAAT TGGAGATTGC TTTTAAAGAT TCTATTAAAA AAGCATATCA AATCGAATAT
GTCACAAAAA TCACTGATGA AAACGTAAAA AGCTTCCAAA ATAACGTTAC GATAACAAGT
GATAATCAAG GGCAACAAAA AGCAAGCTCT ACTGTAACAG TCTCTCGTGG TACACATTTA
AACAAAACAA GTAAATATGA TCCAAAGACC CAAACAATTG AATGGACGAT TACTTACAAC
GGTGATCAAA GAAATATCAA AAAAAACAGA TGCACTTTTA AAAGATATTT TTGA
 
Protein sequence
MFKGGKMKKL FNICLIVFVL FSQFISFPYN QAKAETLKET SLFDTVEMKD ATDQIIDEAK 
NPNNLIKIGS TIQVEYAWSI KDQQVVHAND TAVLQIPLAL KVSKDLQGDL VTDQKNIGQY
FITAKDNKLK LIFNDQVENS KDAKGKIKID TVFNPTLKTE EKSVQIAFPL GTLVQPITVP
IQVEDSKEDG TKQDTNKQVQ DQVAKPTTDN PEQNPATKPA TDNPEQNSAT KPATDNPEQN
PATKPATDNP EQNPATKPAT DNPEQNPATK PAADNPEQNL ASDPAEITNS GPKQITTNIL
TGVKLTDKDG KPFTEDNRPS TDSPANIEFT WELLKSMNVK SGDYYIFDLP KHFKIYNTIN
SPLYDSENNP IGNFTVTKDG KVTMTFNDYV EEHPDVVGNL QLKTEFNKAE IKGTTTQEIP
FPIKDKDVSI TVDFKPNVQT ATNKKGLPDR PINTNEINWT VEMNKTKDTL KNAVFKDNIP
QGTSLNKDSI KVYYLEVDVN GNATRGQEAD PADYKIISSD GSKLEIAFKD SIKKAYQIEY
VTKITDENVK SFQNNVTITS DNQGQQKASS TVTVSRGTHL NKTSKYDPKT QTIEWTITYN
GDQRNIKKNR CTFKRYF