Gene Gdia_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0194 
Symbol 
ID6973586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp208856 
End bp210106 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID643389726 
Productcystathionine beta-lyase 
Protein accessionYP_002274607 
Protein GI209542378 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01324] cystathionine beta-lyase, bacterial 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.212161 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.190528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC GTCAGGCCCT GCATCATGCC GTCTCCCGCG GCTGGCGCCA TATGGGCACC 
ACGCTGGTCC ATCTGGCGCG CGACCTGCCG CCGGCGGCGG AGGGCACGCT GGTCAACCCG
GCCTCCACGC GCGGATCGAC GGTGCTGTTC CCCACCGTCG CGGACATGAA CCGCAACGGC
ACGCGGCGCT ATGATCACGA ACTGATCTAT GGTGCCATGG GCACGCCCAT CCAGCACCAG
CTGGAATCCG CGATCGCGGC GCTGGAGGGT GCGCGCCATA CGCAGGTGGT CTCGTCCGGC
CTGGCCGCGT GCTCCACGCC GCTGCTGGCC TTCCTGGGCA AGAATGGCCA TTGCCTGATC
CCGGATTCGG TCTATGGCCC GACCCGCCGC TTCGCCAATA CCGTCCTGCG CCGCTTCGGG
GTCGAGACGA CCTATTACCC CCCCATGATC GACGCGGACG GGCTGCGCGC CGCCATGCGG
CCGAACACGC AGGTCGTCTT CGCCGAAAGC CCGGGCAGCC ATACGTTCGA GGTCCAGGAC
ATCCGCATGA TCGCGGACAT CGCGCACGAG CACGGCGCGC GGATGATGCT GGACAATACC
TGGGGCATCG GCGTGTTCCA GCCGTTCGAC CATGGGGTCG ATATCTCGAT CCAGGCGCTG
ACCAAATATC CGGCCGGCCA TTCGGACAGC ATCATCGGCG CGGTATCCGT CGCCGACGAA
CAGGACTGGC AGGCCCTGCG CGATACCAGC ATCCAACTGG GGCAGGTGGC GGGCCCCGAC
GATTGCTGGC TGACCCTGCG CGGGCTGCGG ACCATGGGGG CGCGGCTGGA GCGGCAATCC
CGCGCCGCGA TCGACATCGC CCTGTGGCTG TCCGACCGGC CCGAGGTGGC GCGCGTGCTG
CATCCCGCCC TGCCGTCCTG TCCGGGACAC GAGATATGGC GGCGCGATTA TACCGGGGCG
TCGGCGCTGT TCGGCGTGGT CTTCCAGCCG GAATACGACG CGGCCGCGAT GACGGCGATG
ATCGATTCCC TGGCCCTGTT CGGAATCGGT GCCTCCTGGG GCGGATATGA AAGCCTGGTC
CTGCCGACCA CGGGCGGCAT TACCCGGTCC TGCCCGCCCG GCGAGGAATG CGGCCCGCAG
ACCGGCCCGG CCTGCCGCCT GCATATCGGA CTGGAAAACC CCGAGGACCT GATCGCGGAC
CTGTCGGCCG GGCTGGAGGT GCTGGCCGGC GGGGCCGTGT ACGGGGCATA G
 
Protein sequence
MTDRQALHHA VSRGWRHMGT TLVHLARDLP PAAEGTLVNP ASTRGSTVLF PTVADMNRNG 
TRRYDHELIY GAMGTPIQHQ LESAIAALEG ARHTQVVSSG LAACSTPLLA FLGKNGHCLI
PDSVYGPTRR FANTVLRRFG VETTYYPPMI DADGLRAAMR PNTQVVFAES PGSHTFEVQD
IRMIADIAHE HGARMMLDNT WGIGVFQPFD HGVDISIQAL TKYPAGHSDS IIGAVSVADE
QDWQALRDTS IQLGQVAGPD DCWLTLRGLR TMGARLERQS RAAIDIALWL SDRPEVARVL
HPALPSCPGH EIWRRDYTGA SALFGVVFQP EYDAAAMTAM IDSLALFGIG ASWGGYESLV
LPTTGGITRS CPPGEECGPQ TGPACRLHIG LENPEDLIAD LSAGLEVLAG GAVYGA