Gene B21_01383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01383 
SymbolgapC 
ID8115365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1442406 
End bp1443407 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content48% 
IMG OID644847626 
Producthypothetical protein 
Protein accessionYP_002999199 
Protein GI251784895 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0057] Glyceraldehyde-3-phosphate dehydrogenase/erythrose-4-phosphate dehydrogenase 
TIGRFAM ID[TIGR01534] glyceraldehyde-3-phosphate dehydrogenase, type I 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAG TTGGTATTAA CGGTTTTGGT CGTATCGGTC GACTGGTGTT GCGTCGATTA 
CTTGAAGTCA AAAGCAACAT AGACGTTGTC GCTATTAATG ATCTCACTTC CCCAAAAATT
CTCGCCTACC TGCTGAAACA TGATTCAAAC TACGGACCAT TCCCCTGGAG CGTTGATTTT
ACGGAAGATT CACTTATCGT TGATGGGAAA AGTATCGCGG TTTACGCCGA AAAAGAGGCT
AAAAATATTC CGTGGAAAGC GAAAGGTGCA GAAATCATTG TCGAATGTAC TGGCTTTTAT
ACCTCCGCCG AGAAATCGCA GGCGCATCTT GATGCTGGTG CGAAGAAGGT GTTGATTTCC
GCCCCTGCCG GTGAAATGAA AACTATCGTT TATAACGTCA ATGACGACAC TCTGGATGGC
AACGACACCA TTGTTTCCGT GGCGTCATGC ACCACTAACT GTCTTGCGCC GATGGCCAAA
GCCTTGCATG ACAGTTTCGG GATAGAAGTC GGCACGATGA CGACCATTCA TGCCTATACT
GGCACCCAGT CACTGGTGGA TGGCCCACGT GGTAAAGATT TACGTGCTTC ACGCGCAGCG
GCAGAAAATA TCATTCCCCA CACTACGGGG GCGGCAAAAG CCATTGGTCT GGTGATCCCG
GAACTGAGCG GCAAACTGAA AGGTCATGCG CAACGCGTGC CGGTGAAAAC AGGTTCGGTC
ACTGAGCTGG TGTCCATTCT CGGAAAAAAA GTGACTGCCG AAGAGGTGAA TAACGCACTT
AAACAAGCAA CCACCAATAA CGAGTCATTT GGTTATACCG ATGAAGAAAT AGTCTCTTCC
GATATCATTG GCAGCCATTT CGGTTCGGTG TTTGATGCCA CGCAAACGGA AATTACCGCT
GTGGGCGATT TACAACTGGT GAAAACGGTC GCCTGGTACG ATAACGAATA TGGCTTCGTC
ACGCAGCTTA TTCGCACCCT CGAAAAATTC GCTAAACTCT GA
 
Protein sequence
MSKVGINGFG RIGRLVLRRL LEVKSNIDVV AINDLTSPKI LAYLLKHDSN YGPFPWSVDF 
TEDSLIVDGK SIAVYAEKEA KNIPWKAKGA EIIVECTGFY TSAEKSQAHL DAGAKKVLIS
APAGEMKTIV YNVNDDTLDG NDTIVSVASC TTNCLAPMAK ALHDSFGIEV GTMTTIHAYT
GTQSLVDGPR GKDLRASRAA AENIIPHTTG AAKAIGLVIP ELSGKLKGHA QRVPVKTGSV
TELVSILGKK VTAEEVNNAL KQATTNNESF GYTDEEIVSS DIIGSHFGSV FDATQTEITA
VGDLQLVKTV AWYDNEYGFV TQLIRTLEKF AKL