Gene Gdia_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2014 
Symbol 
ID6975440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2235257 
End bp2236312 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content70% 
IMG OID643391543 
ProductThreonine aldolase 
Protein accessionYP_002276389 
Protein GI209544160 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.316829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTAGAGC AGATACGCAA AAATTTCAGC AGCGACAACG TCGTTCCGGC CTGCCCTGCC 
GTGATGTCCG CCCTGATGGC GGCCAACGAG GGCGCCGCCC CCGCCTACGG CGCCGACGCC
TGGACCGCGC GCCTGCAGCA GGTGGCGGCG GATGTCTTCC AGCACGCGGT CCAGGTCTTT
CCCGTCACCA CCGGCACGGC GGCGAACGCG CTGGCGCTGG CGGCGATAAC CCCACCCTAT
GGCGCGGTCC TGTGCGACGA GAGCGCGCAT ATCGTCCAGT CCGAATGCGG GGCGCCCGAT
TTCTATACCG GCGGCGCGCG GCTGCTGACG ATCCCGTCCG AAGACGGGCG CATGGACCCC
GCTGCCCTGT CCTATGTGCT GGACCGCCAC CCGGCTTCCA ACGTGCAGGA CAACCTGCCG
ACGACGCTGA GCCTGACGCA GGCCACGGAA TGGGGCACCG TCTACGACCC GGCGCGGATC
GCGGACCTGA CGGCGCGGGC GCGCGCCCGG GGCCTGGCGG TGCATCTGGA CGGCGCACGG
CTGGCCAACG CCATCGTCCA TCTGGGATGC ACGCCCGCCG AGGCGACATG GAAGGCCGGG
ATCGACGTGC TGGCGCTGGG CGCGACCAAG AACGGCGCGA TGGCGGCCGA AGCCGTGATC
ATCTTCGACC CCGCGCGGGC CGAGCAGTTC GCCCGGCGCC GCAAGCGCGG CGGCCATGGC
TGGTCCAAGC AGCGCTTCCT CAGCGCACAG TTGCTGGCCT GCCTGGAAGA CGATCTGTGG
CTGAACAACG CGCGGCAGGC CAACGCCATG GCGCACCGGC TGGCAGGCGG CCTGTTCCGC
CACCCCGGCG CCCGCCTGGT CTATGAAACC CAGGCCAACG AGATCTTCGT CATGCTGCCC
GACCGGGCGA TCGCGCACCT GCGCGCGGCA GGGTTCGTCT TCCGCGACTG GCCCACCCCG
CTGGGCGTGG AGGGGACCGT CGTGCGGCTG GTCACCAGTT ATTATACGCG CGTGGCGGAT
GTGGACGCGT TCCTGGCGAC CCTGGCGGAA GTATAA
 
Protein sequence
MVEQIRKNFS SDNVVPACPA VMSALMAANE GAAPAYGADA WTARLQQVAA DVFQHAVQVF 
PVTTGTAANA LALAAITPPY GAVLCDESAH IVQSECGAPD FYTGGARLLT IPSEDGRMDP
AALSYVLDRH PASNVQDNLP TTLSLTQATE WGTVYDPARI ADLTARARAR GLAVHLDGAR
LANAIVHLGC TPAEATWKAG IDVLALGATK NGAMAAEAVI IFDPARAEQF ARRRKRGGHG
WSKQRFLSAQ LLACLEDDLW LNNARQANAM AHRLAGGLFR HPGARLVYET QANEIFVMLP
DRAIAHLRAA GFVFRDWPTP LGVEGTVVRL VTSYYTRVAD VDAFLATLAE V