Gene TBFG_10070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10070 
SymbolglyA 
ID5220733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp77761 
End bp79038 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content66% 
IMG OID640604810 
Productserine hydroxymethyltransferase 
Protein accessionYP_001286015 
Protein GI148821261 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones321 
Plasmid unclonability p-value0.9411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones188 
Fosmid unclonability p-value0.116151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACCC TCAACGACTC CCTGACCGCC TTCGACCCGG ACATCGCCGC CCTGATCGAC 
GGCGAGCTGC GCCGTCAAGA ATCCGGCTTG GAGATGATCG CTTCGGAGAA CTATGCACCG
CTGGCCGTGA TGCAGGCCCA AGGTTCGGTC TTGACCAACA AGTACGCCGA AGGCTACCCG
GGCCGGCGCT ACTACGGTGG CTGTGAATTC GTCGACGGTG TCGAGCAGTT GGCTATCGAC
CGCGTCAAAG CGCTCTTTGG CGCCGAATAC GCCAACGTGC AACCACATTC GGGGGCCACC
GCCAACGCCG CCACCATGCA TGCGCTGCTA AACCCCGGCG ACACCATCCT GGGGTTGTCG
CTGGCTCATG GCGGTCACCT GACCCACGGG ATGCGGATCA ACTTTTCCGG CAAGCTCTAC
CACGCCACCG CCTACGAGGT GTCCAAAGAG GACTACCTGG TCGACATGGA TGCCGTCGCC
GAGGCAGCGC GCACACACCG GCCCAAAATG ATCATCGCCG GCTGGTCGGC GTACCCACGC
CAGCTGGATT TCGCCCGCTT CCGCGCCATC GCCGACGAAG TCGACGCCGT GCTCATGGTG
GATATGGCGC ATTTCGCCGG CCTGGTCGCC GCTGGCGTGC ACCCCAGCCC GGTGCCGCAC
GCCCACGTCG TCACCTCCAC CACTCACAAG ACGCTCGGCG GGCCCCGCGG CGGCATCATC
TTGTGCAATG ACCCGGCCAT CGCCAAGAAG ATCAATTCCG CGGTCTTCCC TGGGCAGCAG
GGCGGGCCGC TCGAGCATGT CATCGCAGCC AAGGCCACCG CATTCAAGAT GGCAGCACAA
CCTGAATTCG CGCAGCGCCA ACAACGTTGC CTCGACGGCG CGCGCATCCT TGCCGGCCGG
TTGACCCAGC CCGACGTCGC CGAACGTGGC ATCGCGGTGC TAACCGGCGG CACCGATGTG
CACCTCGTCC TAGTCGACCT GCGCGACGCC GAACTCGACG GCCAGCAAGC CGAAGACCGG
TTGGCCGCCG TGGACATCAC CGTCAACCGC AACGCGGTAC CCTTCGACCC TCGTCCCCCG
ATGATCACCT CGGGCCTGCG AATCGGCACC CCGGCGCTGG CCGCACGCGG CTTCTCCCAC
AACGACTTCC GCGCCGTGGC AGACCTCATC GCGGCGGCAC TGACGGCCAC CAACGACGAC
CAGCTGGGTC CGCTGCGCGC CCAGGTCCAG CGGCTGGCCG CACGCTATCC GCTCTACCCG
GAACTGCACC GGACATGA
 
Protein sequence
MNTLNDSLTA FDPDIAALID GELRRQESGL EMIASENYAP LAVMQAQGSV LTNKYAEGYP 
GRRYYGGCEF VDGVEQLAID RVKALFGAEY ANVQPHSGAT ANAATMHALL NPGDTILGLS
LAHGGHLTHG MRINFSGKLY HATAYEVSKE DYLVDMDAVA EAARTHRPKM IIAGWSAYPR
QLDFARFRAI ADEVDAVLMV DMAHFAGLVA AGVHPSPVPH AHVVTSTTHK TLGGPRGGII
LCNDPAIAKK INSAVFPGQQ GGPLEHVIAA KATAFKMAAQ PEFAQRQQRC LDGARILAGR
LTQPDVAERG IAVLTGGTDV HLVLVDLRDA ELDGQQAEDR LAAVDITVNR NAVPFDPRPP
MITSGLRIGT PALAARGFSH NDFRAVADLI AAALTATNDD QLGPLRAQVQ RLAARYPLYP
ELHRT