Gene GBAA_5448 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_5448 
SymbolcelB-3 
ID2819161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4936039 
End bp4937340 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content39% 
IMG OID637792116 
ProductPTS system cellobiose-specific transporter subunit IIC 
Protein accessionYP_022111 
Protein GI47530762 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACGGT TTTTAGAGAA ATATGTAATG CCGGTAGCAG GGAAGGTTGC AGAGCAGAGG 
CACTTGCAAG CAATTCGAGA TGGAATTATT TTAACGATGC CTTTTTTAAT CATTGGATCA
TTTTTCCTTA TTATTAGTGC ATTACCAATA CCAGGCTATA ATGATTTTAT GGCAGGTTTG
TTTGGGGAGA ATTGGCAGAG GGCTTTGGGG TATCCAGTTA GTGCGACTTT TAATATAATG
GCTTTAATAG CTGTTTTTGG AATCGCTTAC AGGCTTGGGG AATATTATAA AGTGGATGCT
TTAGCATCCG GAGCATTGTC CCTTGTGACG TTTTTACTTG TGACGCCATT TCAAGTTGCA
TATATTATAC CAAGTACAAA AGAGAGTGTA CTTGTAGAAG GTGCTATCCC AGCTGCATTA
ATGGGAAGCC AAGGGTTGTT TGTAGCAATG ATTATTGCAC TTATATCTAC TGAACTTTAT
CGGTTTATTG TACAAAAAAA GATAATTATA AAGATGCCAG AAACAGTTCC ACCAGCCGTG
ACGCGCTCAT TTGCGGCACT TGTTCCAGGA TTTATTGTTG TAACGGTTAT TTGGATTGTA
CGCTTAATTA TAGAAAATAC TTCTTTTGGC AGTATCCATA ATATTGTAGG GCAAATTTTG
CAGGAACCAC TTAGTGTACT TGGTGCTAGT CTTTGGGGCG CAATAATAGC AGTTATTCTC
GTTCATGTCC TTTGGTCTTG TGGAATTCAT GGTGCTACTA TTGTTGGTGG TGTAATGAGT
CCTGTTTGGT TGTCGTTAAT GGATCAAAAC CGAGTTGCTT TCCAAGCGGG GCAAGATGTA
CCAAATACGA TTACCGCACA GTTTTTTGAT TTATGGATTT ATATGGGCGG TTCCGGCGCA
ACACTAGCTT TAGTTGTCGG AATGTTATTG TTTGCACGAA GTCAGCAATT AAAAAGTTTA
GGGAGATTGT CAATTGCACC TGGTATATTT AATATTAATG AGATGGTAAC TTTTGGTATG
CCGATTGTAA TGAACCCAAT TTTATTAATT CCATTTATAT TAGTTCCGGT TGTGTTAACG
ATCGTTTCTT ACTTTGCAAT GGAATGGGGA TTAGTTGCTC GCCCGAGTGG AGCTGCTGTA
CCTTGGACGA CACCTATTCT TTTTAGTGGA TATTTAGGAT CGGGCGGGAA AATTTCAGGC
GTTGTTTTAC AACTTGTTAA CTTTGCGCTT GCATTCTTCA TTTATTTACC GTTCTTAAAA
ATATGGGATA AACAAAAAGT AGCGGAAGAA AAGGGGGAGT AA
 
Protein sequence
MIRFLEKYVM PVAGKVAEQR HLQAIRDGII LTMPFLIIGS FFLIISALPI PGYNDFMAGL 
FGENWQRALG YPVSATFNIM ALIAVFGIAY RLGEYYKVDA LASGALSLVT FLLVTPFQVA
YIIPSTKESV LVEGAIPAAL MGSQGLFVAM IIALISTELY RFIVQKKIII KMPETVPPAV
TRSFAALVPG FIVVTVIWIV RLIIENTSFG SIHNIVGQIL QEPLSVLGAS LWGAIIAVIL
VHVLWSCGIH GATIVGGVMS PVWLSLMDQN RVAFQAGQDV PNTITAQFFD LWIYMGGSGA
TLALVVGMLL FARSQQLKSL GRLSIAPGIF NINEMVTFGM PIVMNPILLI PFILVPVVLT
IVSYFAMEWG LVARPSGAAV PWTTPILFSG YLGSGGKISG VVLQLVNFAL AFFIYLPFLK
IWDKQKVAEE KGE