Gene Gdia_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3029 
Symbol 
ID6976463 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3314376 
End bp3315572 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content70% 
IMG OID643392537 
Productthreonine dehydratase 
Protein accessionYP_002277374 
Protein GI209545145 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.0116526 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCC TCAACGACGT TCGCGCCGCC GCCGCGCGGA TCGAGGGGCG TGTCCTGCGC 
ACGCCCACCG TTCCCTCGCA CGCGCTTTCC AGGGCGACGG GGGCGGACAT CGTCCTGAAG
CTGGACAATC TCCAGGCCGT GGGCTCGTTC AAGGAACGCG GGGCCGCCAA CAAGCTGGCG
CTGCTGACGC CCGAGGAACG CGCGCGCGGA GTGATCACCG TATCGGCCGG CAACCACGCC
CAGGGCGTGG CCCGCCATGC CGCGCTGCTG GAAATCGACG CGGTCATCGT CATGCCGCGC
TTCACCCCGG CCGCCAAGGT CACCCGCACC GCCGCCTGGG GCGCGCGCGT GGTGCTGGAA
GGCGACAATT TCGCCGAGGC CACGGCCCAT GCCAACGCCC TGTGCGCGCG GGAAGGCCGG
GTCTTCGTCC ATCCTTACGA CGACCCCGAG GTCATGGCCG GCCAGGGCAC CTTCGCCCTG
GAACTGTTCG AGGATGCGGG CCCGCTCGAC ATCTTCGTCG GTCCGATCGG CGGCGGCGGC
CTGCTGTCGG GCTGCGCCGT GGTGGGCCGC GCGCTGCGTC CCGGCATGGA CATCATGGGC
GTGCAGGTCG AAAGCTATTC CTCGCTGTCC GCCTTCCCGG GGGACGAGAT CATGCCCCCC
GGCGGCGCCA CGATCGCCGA GGGCATCGCG GTGCTGCAGA TCGGGCGGCA GCCGCTGTCG
GTCATTCGCG ACCATGTCTC ACGCGTGCTG GTGGTGCCGG AACGCGCGGT CGAGGATGCG
ATCACCCTGA TGGCCGAGGG CGCCAAGCAG GTCAGCGAGG GCGCGGGCGC CAGCGCCCTG
GCGGCGGTCC TGACCTATCC CGAACTGTTC CGGGGCAAGC GGGTGGCGCT GCCGGTGACG
GGCGGCAATA TCGACAGCCG CATCCTGGCC AACACGCTGC TGCGCTCGCT GCTGCGCGAC
GGGCGGCTGC TGTGCCTGAA GATGGAAATC CCGGACCGGC CGGGCGTGCT GGCCGACATC
TCGCGCATGA TCGGCGAGGC CGGCGGAAAC ATCATCGAGG TCTCGCACCA GCGCCTGTTC
ACCGCCGCCA GCGTGCAGGC GGCCGAACTG GAAGTGATGA TCGAAGCCCG CGACCCCACC
CACGCGATGG AGATCATGGG GCAACTGGCC AAGACCTACA TCGTCCGCCG GGTGTAG
 
Protein sequence
MITLNDVRAA AARIEGRVLR TPTVPSHALS RATGADIVLK LDNLQAVGSF KERGAANKLA 
LLTPEERARG VITVSAGNHA QGVARHAALL EIDAVIVMPR FTPAAKVTRT AAWGARVVLE
GDNFAEATAH ANALCAREGR VFVHPYDDPE VMAGQGTFAL ELFEDAGPLD IFVGPIGGGG
LLSGCAVVGR ALRPGMDIMG VQVESYSSLS AFPGDEIMPP GGATIAEGIA VLQIGRQPLS
VIRDHVSRVL VVPERAVEDA ITLMAEGAKQ VSEGAGASAL AAVLTYPELF RGKRVALPVT
GGNIDSRILA NTLLRSLLRD GRLLCLKMEI PDRPGVLADI SRMIGEAGGN IIEVSHQRLF
TAASVQAAEL EVMIEARDPT HAMEIMGQLA KTYIVRRV