Gene Gdia_0678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0678 
Symbol 
ID6974075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp767841 
End bp768929 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content66% 
IMG OID643390207 
ProductThreonine aldolase 
Protein accessionYP_002275083 
Protein GI209542854 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000267715 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATCCTCG CGATGCGCGA CATCGGACGA GGAAAGGCCT TTACCGCAAT GCCCCAGACG 
CAGCAGTTCG CCAGCGACAA TTACGCCGGT ATCTGTCCCG AAGCCCTGCG TGCGATGGAA
GACGCCAATC GCGGGTCCGC CCAGGCCTAT GGTGACGATC CCTGGACGGT CCGCGCGGCC
GATGCGTTTC GCGCGACCTT CGAGACCGAT TGCGAGCTGT TCTTCGTCTT CAACGGCACG
GCCGCCAATT CGCTGGCCCT GGCCGCGCTG TGCCAGTCGT ACGAAAGCAT CATCGCCGCC
GACATCGCCC ATGTCGAAAC CGACGAATGC GGGGCGCCGG AATTCTTTTC CAACGGCGCC
AAGATCCTGG TCGGGCGGAC CGAAAACGGC AAGCTGACGC CGGAGACGAT CCGCGGCTTC
GCGCGGGGAC GCACCGACAT CCATTTCCCG CGTCCGCGCG CCGTCACCAT CACCCAGCCC
ACCGAGACCG GCCAGGTCTA CAGCCTGGAG GAAATCCGCG CCGTCGCCGC CACCTGCCGC
GAACTGGGGC TGAAGCTGCA TATGGATGGC GCGCGATTCG CCAATGCCGT CGAAAGCCTG
GGCTGCACCC CGGCCGACAT GACGTGGCGT TCCGGGGTGG ACGTGCTGAG CTTCGGCGGC
ACCAAGAACG GCATGGCGAT CGGCGAGGCG GTGATCTTCT TCAACCGCTC CCTGGCCGAA
GGGTTCGATT ATCGCTGCAA GCAGGCCGGC CAACTGGCAT CCAAGATGCG CTTCCTGTCC
GCGCCCTGGG CCGTGATGTT CGAAACCGGC GCATGGCGCG CGAATGCCGC CCATGCCAAT
GCCTGCGCCC GTCACTTCGC CGATCAGGTC GCGGACCTGC CGGGGGTGGA GGTGCAGTTT
CCCGTCGAGG CGAACGCGGT ATTCCTGAAG CTGCCCCCCG AGACGACCGC GGCGCTGCGG
GCGCGCGGCT GGCTGTTCTA TACCTTCATC GGTGACAGTG CCCGCTTCAT GTTCGCCTGG
GATTCGGAGC GTGCGCGCAT CGACGCGCTG GCGGCGGACC TGCGCGCCAT CACCCGCACG
CCGGCATGA
 
Protein sequence
MILAMRDIGR GKAFTAMPQT QQFASDNYAG ICPEALRAME DANRGSAQAY GDDPWTVRAA 
DAFRATFETD CELFFVFNGT AANSLALAAL CQSYESIIAA DIAHVETDEC GAPEFFSNGA
KILVGRTENG KLTPETIRGF ARGRTDIHFP RPRAVTITQP TETGQVYSLE EIRAVAATCR
ELGLKLHMDG ARFANAVESL GCTPADMTWR SGVDVLSFGG TKNGMAIGEA VIFFNRSLAE
GFDYRCKQAG QLASKMRFLS APWAVMFETG AWRANAAHAN ACARHFADQV ADLPGVEVQF
PVEANAVFLK LPPETTAALR ARGWLFYTFI GDSARFMFAW DSERARIDAL AADLRAITRT
PA