Gene Gdia_1236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1236 
Symbol 
ID6974641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1376093 
End bp1377295 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content68% 
IMG OID643390766 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_002275634 
Protein GI209543405 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.43898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG ATTCGACCGA CCGAGCCTAC CGCCCCGCGA CCCGCCTGCT GCATTCCGGC 
GTCGAGCGCA CCCCGTTCGG CGAGACGAGC GAGGCGATGT TCCTGACATC GGGCTTCGTC
TACGACAATG CCGAACAGGC CGAGGCGACC TTCACCGGCG ACGTGACGCA TTATCAGTAC
AGCCGGTTCG GCAACCCCAC CGTCGAGGCG CTGGAGAAGC GCCTGGCGGA CCTGGAAGGG
GCCGAGGCCT GCATCGCGAC CTCGACGGGC ATGGGGGCGG TGTCGTCCGC GCTGCTGTCG
CACGTCAAGG CCGGGGACCG GGTGGTGGCG TCGCGCGCGC TGTTCGGGTC GTGCCACTGG
ATCGTCGCCA ACCTGCTGCC GCGCTACGGG GTGGAAACGG TGTTCGTGGA CGGGGGCGAC
ATGGACGCGT GGGCGGAGGC CCTGGCGCGG CCCACGGCGG CGGTCCTGCT GGAAAGCCCG
TCGAACCCGA TGCTGGACAT CCTGGACATC CGCGCCATTT CCGACCTGGC GCACCAGGCC
GGAGCCCTGG TGGTGGTCGA CAACGTCTTC GCGACCCCAC TGCTGCAGAA GCCGCTGGAA
CTGGGGGCGG ATGTCGTCGT GTATTCCTGC ACCAAGCATA TCGACGGCCA GGGGCGTGTC
CTGGGCGGCG CGGTGCTGGG CCGCAAGGAC TGGATCACCG ATACCCTGCA GCCCTTCACC
CGCAATACCG GCAACGCCCT GTCGCCGTTC AATGCCTGGG TGATGCTGAA GGGGCTGGAG
ACGCTGGCCC TGCGCGTCCG GGCAATGACC GACAATGCCG CCGCCGTCGC CGATCACCTG
GCCGGCGCCG AGGGGGTGAC GCGGGTGTTC TATCCCGGCC GGCCCGACCA TCCGCAATAC
GCGCTGGCGC AGGCGCAGAT GAGCGGCGCC TCCACCCTGG TCGCCTTCGA GGTCGCGGGC
GGCAAGGCGC GCGCGTTCGC CTTCATGAAC GCGTTGCGGC TGATCGCGAT TTCCAACAAT
CTGGGTGATG CGCGATCGAT GGTGACGCAC CCGGCCACCA CCACGCACAT GAAGATCGGC
GCCGAGGAAC GGGCGCGACT GGGCATCACC GACGGCGTGA TCCGCTTTTC GGTGGGTCTG
GAAGACAGCG CCGATCTGAA GGATGATCTG GATCGCGGGC TGGCCGCCCT TCGGTCACGC
TGA
 
Protein sequence
MSNDSTDRAY RPATRLLHSG VERTPFGETS EAMFLTSGFV YDNAEQAEAT FTGDVTHYQY 
SRFGNPTVEA LEKRLADLEG AEACIATSTG MGAVSSALLS HVKAGDRVVA SRALFGSCHW
IVANLLPRYG VETVFVDGGD MDAWAEALAR PTAAVLLESP SNPMLDILDI RAISDLAHQA
GALVVVDNVF ATPLLQKPLE LGADVVVYSC TKHIDGQGRV LGGAVLGRKD WITDTLQPFT
RNTGNALSPF NAWVMLKGLE TLALRVRAMT DNAAAVADHL AGAEGVTRVF YPGRPDHPQY
ALAQAQMSGA STLVAFEVAG GKARAFAFMN ALRLIAISNN LGDARSMVTH PATTTHMKIG
AEERARLGIT DGVIRFSVGL EDSADLKDDL DRGLAALRSR