Gene GBAA_5443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_5443 
SymbolcelB-2 
ID2819175 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4932110 
End bp4933417 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content38% 
IMG OID637792111 
ProductPTS system cellobiose-specific transporter subunit IIC 
Protein accessionYP_022106 
Protein GI47530757 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAT TTATTGCATT TATGGAGAAA TATATTGTTC CTGTCGCTGG TAAAATCGGG 
TCGCAACGTC ATTTAGCTGC GATCCGTGAC GGATTTATCG CAGTTATGCC ACTTATTTTA
GTTGGTGCAC TGGCATCACT AATTAATGGT TTTCCATCTG AGGCTTTCCA AGATTTCATG
AAAGGTTTGT TTGGTGAAAC GTGGAAACAA GTTGGCGGTG GAATGTGGAC TGGTTCTTTC
GCGATTCTAG CACTAATCAT AGCATTTACA ACAAGTTATA ACTTAGCAAA ATCTTACGGC
GTTGATGGTT TGTCAGCAGG TATTATTTCA TTTGGTGCGT TAATTATTCT TACGCCAACA
ACACCGAAAG AAGGCGGATT GAACTTAGCT TGGACAGGTG CACAAGGGTT ATTCGTAGCA
ATTATTGTAG CACTCCTTGT TACTGAAGTA TTCCGTTTCT TCGTACAAAG AAACATTACT
TTTAAAATGC CTGATGGAGT ACCACCAGCA GTTTTAAGAT CTTTCGCAGC TATAGTTCCA
GCATTTGTTA TTTTAACAGT AGTTGCAGGT ATTCAATTAG CAGTGAAATT AGCCGGTACA
AGTGTTCATG AATTTATCTT TAATACGATT CAATCGCCAC TGCAAAGTTT AGCAGGGACA
TTACCAAGTG CAATTGTTAT TGTACTCCTT GTTCATCTTC TTTGGTTCTT CGGTTTACAT
GGTCCAAATA TCGTTGGTGG TATTATTGAG CCGTTATACT TACCAGCATT AGAGAAAAAT
ATGAAGTTAT TCCAAGGTGG CACTTCTGCA TTTGATGTTC CAAACATTGT TACAAAACCA
TTCTTTGATA CTTTCGTATA TCTTGGTGGT TCTGGTGCAA CATTAGCGTT CTTAGTAGTG
GTATTACTTG TAGCAAAAAG TGCACAATTA CGCGGTGTAT CTCGCTTATC AATTGGTCCA
GGTGCGTTCA ACATTAACGA ACCAGTAATC TTTGGTACAC CAATTATTTT AAATCCAGTT
TTATTCTTGC CGTTTATCAT AACACCAATT GTATTGGTAA TTACTTCTTA TACAGCTATA
TCTATTGGCT GGGTACCAAA AACAGTTGCA ATGATTCCAT GGGCAACACC ACCAATTATT
AGTGGTTATC TTGTAACAGG TGGACATCTT TCAGGTGCAA TTCTACAGTT ATTCAACTTT
GTAATTGCAA TGGTAATCTA TTATCCATTC GTTGTGTTAT GTGACCGTTC AGTTGTTCGT
ACTGAAAAAG CAGCAGCACA AGGAAATAAC AACTCTGTAC CTATGTAA
 
Protein sequence
MQKFIAFMEK YIVPVAGKIG SQRHLAAIRD GFIAVMPLIL VGALASLING FPSEAFQDFM 
KGLFGETWKQ VGGGMWTGSF AILALIIAFT TSYNLAKSYG VDGLSAGIIS FGALIILTPT
TPKEGGLNLA WTGAQGLFVA IIVALLVTEV FRFFVQRNIT FKMPDGVPPA VLRSFAAIVP
AFVILTVVAG IQLAVKLAGT SVHEFIFNTI QSPLQSLAGT LPSAIVIVLL VHLLWFFGLH
GPNIVGGIIE PLYLPALEKN MKLFQGGTSA FDVPNIVTKP FFDTFVYLGG SGATLAFLVV
VLLVAKSAQL RGVSRLSIGP GAFNINEPVI FGTPIILNPV LFLPFIITPI VLVITSYTAI
SIGWVPKTVA MIPWATPPII SGYLVTGGHL SGAILQLFNF VIAMVIYYPF VVLCDRSVVR
TEKAAAQGNN NSVPM