Gene Gdia_2110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2110 
Symbol 
ID6975537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2336991 
End bp2338019 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content66% 
IMG OID643391639 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_002276484 
Protein GI209544255 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR01296] aspartate-semialdehyde dehydrogenase (peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGATACC GCGTTGCGGT CGTCGGCGCC ACGGGCGCCG TCGGGCGGGA AATGCTCAAG 
ACGCTGGCCG AACGCGCGTT TCCGATAGAC GAGATCGTCG CCCTGGCCTC GGCCCGATCG
GCGGGGCAGG AGGTCTCGTT CGGCGACAAG ACGGTGCTGA AGGTGCAGAA CCTGGAGCAT
TTCGACTTCA CCGGCTGGGA CATCGCCCTG TTCTCGCCCG GGGCGTCGGT TTCGGCGGTG
CATGCGCCGC GCGCGGCCAA GGCCGGATGC ATCGTCATCG ACAACACGTC GCATTTCCGC
ATGGAACACG ACGTGCCGCT GGTGGTGCCC GAGGTCAACC CGAACGCGCT GAAGCGGGCG
CGGCGCGGCA TCATCGCCAA CCCGAACTGC TCGACCATCC AGATGGTGGT GGCGCTGAAG
CCGCTGCACG ACCTGTTCAC CATCCGCCGC GTCGTGGTGG CCACGTACCA GGCGGTGGCC
GGCGCGGGCA AGGAAGGCAT GGACGAACTG TTCGCCCAGT CGCGCGCCAG CTTCGTGGGC
GACCCGCTGA AGGCCGAACA GTTCACCAAG CAGATCGCCT TCAACTGCAT TCCCCATATC
GACCGTTTCA TGGATGACGG CGCGACCAAG GAGGAATGGA AGATGACGGC CGAGACCCGC
AAAATCCTTG ACCCTGACAT CTCGGTTTTC GCTACCTGCG TGCGCGTGCC GGTCTTCATC
GGCCATTCCG AGGCCATCAC GGTCGAGTTC GAGGAACCCG TGGACCTGGA GCGCGCGCGG
GAGGCCCTGC GCGAGGCGCC GGGCGTCATC CTGCACGACC AGCGCGAGGA TGGCGGCTAC
GTCACGCCGA CAGAATGTGT TGGTGAGGAC GCAACTTACG TGTCGCGCCT GCGGATCGAC
CCGACCGTAC CCAACGGCCT GGCCTTCTGG TGCGTGGCGG ACAATCTGCG CAAGGGGGCC
GCGCTGAATG CAGTACAGAT CGCGGAAACC ATGATCGCGC TGGACCTGAT TCACCACAAG
GCAGCCTGA
 
Protein sequence
MGYRVAVVGA TGAVGREMLK TLAERAFPID EIVALASARS AGQEVSFGDK TVLKVQNLEH 
FDFTGWDIAL FSPGASVSAV HAPRAAKAGC IVIDNTSHFR MEHDVPLVVP EVNPNALKRA
RRGIIANPNC STIQMVVALK PLHDLFTIRR VVVATYQAVA GAGKEGMDEL FAQSRASFVG
DPLKAEQFTK QIAFNCIPHI DRFMDDGATK EEWKMTAETR KILDPDISVF ATCVRVPVFI
GHSEAITVEF EEPVDLERAR EALREAPGVI LHDQREDGGY VTPTECVGED ATYVSRLRID
PTVPNGLAFW CVADNLRKGA ALNAVQIAET MIALDLIHHK AA