Gene BAS5063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5063 
Symbol 
ID2850729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4937250 
End bp4938551 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content39% 
IMG OID637508318 
ProductPTS system cellobiose-specific transporter subunit IIC 
Protein accessionYP_031302 
Protein GI49188049 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACGGT TTTTAGAGAA ATATGTAATG CCGGTAGCAG GGAAGGTTGC AGAGCAGAGG 
CACTTGCAAG CAATTCGAGA TGGAATTATT TTAACGATGC CTTTTTTAAT CATTGGATCA
TTTTTCCTTA TTATTAGTGC ATTACCAATA CCAGGCTATA ATGATTTTAT GGCAGGTTTG
TTTGGGGAGA ATTGGCAGAG GGCTTTGGGG TATCCAGTTA GTGCGACTTT TAATATAATG
GCTTTAATAG CTGTTTTTGG AATCGCTTAC AGGCTTGGGG AATATTATAA AGTGGATGCT
TTAGCATCCG GAGCATTGTC CCTTGTGACG TTTTTACTTG TGACGCCATT TCAAGTTGCA
TATATTATAC CAAGTACAAA AGAGAGTGTA CTTGTAGAAG GTGCTATCCC AGCTGCATTA
ATGGGAAGCC AAGGGTTGTT TGTAGCAATG ATTATTGCAC TTATATCTAC TGAACTTTAT
CGGTTTATTG TACAAAAAAA GATAATTATA AAGATGCCAG AAACAGTTCC ACCAGCCGTG
ACGCGCTCAT TTGCGGCACT TGTTCCAGGA TTTATTGTTG TAACGGTTAT TTGGATTGTA
CGCTTAATTA TAGAAAATAC TTCTTTTGGC AGTATCCATA ATATTGTAGG GCAAATTTTG
CAGGAACCAC TTAGTGTACT TGGTGCTAGT CTTTGGGGCG CAATAATAGC AGTTATTCTC
GTTCATGTCC TTTGGTCTTG TGGAATTCAT GGTGCTACTA TTGTTGGTGG TGTAATGAGT
CCTGTTTGGT TGTCGTTAAT GGATCAAAAC CGAGTTGCTT TCCAAGCGGG GCAAGATGTA
CCAAATACGA TTACCGCACA GTTTTTTGAT TTATGGATTT ATATGGGCGG TTCCGGCGCA
ACACTAGCTT TAGTTGTCGG AATGTTATTG TTTGCACGAA GTCAGCAATT AAAAAGTTTA
GGGAGATTGT CAATTGCACC TGGTATATTT AATATTAATG AGATGGTAAC TTTTGGTATG
CCGATTGTAA TGAACCCAAT TTTATTAATT CCATTTATAT TAGTTCCGGT TGTGTTAACG
ATCGTTTCTT ACTTTGCAAT GGAATGGGGA TTAGTTGCTC GCCCGAGTGG AGCTGCTGTA
CCTTGGACGA CACCTATTCT TTTTAGTGGA TATTTAGGAT CGGGCGGGAA AATTTCAGGC
GTTGTTTTAC AACTTGTTAA CTTTGCGCTT GCATTCTTCA TTTATTTACC GTTCTTAAAA
ATATGGGATA AACAAAAAGT AGCGGAAGAA AAGGGGGAGT AA
 
Protein sequence
MIRFLEKYVM PVAGKVAEQR HLQAIRDGII LTMPFLIIGS FFLIISALPI PGYNDFMAGL 
FGENWQRALG YPVSATFNIM ALIAVFGIAY RLGEYYKVDA LASGALSLVT FLLVTPFQVA
YIIPSTKESV LVEGAIPAAL MGSQGLFVAM IIALISTELY RFIVQKKIII KMPETVPPAV
TRSFAALVPG FIVVTVIWIV RLIIENTSFG SIHNIVGQIL QEPLSVLGAS LWGAIIAVIL
VHVLWSCGIH GATIVGGVMS PVWLSLMDQN RVAFQAGQDV PNTITAQFFD LWIYMGGSGA
TLALVVGMLL FARSQQLKSL GRLSIAPGIF NINEMVTFGM PIVMNPILLI PFILVPVVLT
IVSYFAMEWG LVARPSGAAV PWTTPILFSG YLGSGGKISG VVLQLVNFAL AFFIYLPFLK
IWDKQKVAEE KGE