Gene Gdia_0631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0631 
Symbol 
ID6974028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp708840 
End bp710030 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content67% 
IMG OID643390162 
ProductInosine/uridine-preferring nucleoside hydrolase 
Protein accessionYP_002275038 
Protein GI209542809 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0417904 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTCT GCCCTCTCCG GCGCGCGATG CGCCTCGGGA CGACCCGACG CGAATCGACG 
CCGCGCGAAG TGCCGCGCCG CGCCTTCCGC CGCGCCCTGG GCGCCGGCGT GCTCGGCCTG
CTGGCCGCCT GGCTGACGGC CGGGCCGCCC GCGCGGGCGG CCGCCCCGCA ACTGGTGATC
GAGGACAATG ATTTCCTCGG TCCCGGCGGA TCGGACCAGC AATCGATCAT TCCGCTGCTG
TTCAATCCGG CGGTCCGGGT CCTGGGCTTC ACCGTCGTGA CCGGCGATGG CTGGGAAAAC
CAGGAATCCG CCCATCTGCG CCGGTTCCTG GACATTATCG GCCGGGGCGA CGTGCCGGTG
GCGGACGGCG CGGTCTATCC GCTGATCAAC AGCGTGCCGA TGATGCGCCT GCGCGAACAG
CAATACGGCG TCATCCCGTG GAAGGGCGCC TGGGGCGGAA TCGGGTCGAT CGACGGCACG
CCCGCGACCC AGCCCCCCTT GCCGGACCTG AAGGAGGGCG CCCCGGCACA GCGTCCGATC
GATGACAGTG CGGCCCTGTT CCTGATCCGT CAGGTCCACA ATCACCCCCA TCAGGTCACC
ATCGTGGCCG CCGGGCCGCT GACCAACCTG GCGCTGGCGA TCCGCATCGA CCCGACCTTC
GCCGCGACGG CGAAACAACT GGTGTTCATG GGCGGATTGC TGGACACCAG CATGAAATCG
CTGACCGGGA ACGTCGATTT CGCCTCGGAC TTCAACATGA TCTTCGACCC GGAGGCGGCG
CATATCACGC TGACGGCACC GTGGCCGTCG ATCACCGTCG TGGGCAACGT GTCGAACGAC
CTGATGATGA CGCGCGACTA CCTGAAGCGC ATCACGCAGA AGAAGACGAA AATGACGGAC
TATATCGGCA CCTATTACGA CCCGCTGCCG ATGTGGGACG AACTGGCGAC GGCGATCGCC
GCCGACCCCG GCCTGATCAC GTCCTCGATC GACGCCTATA TGGACATCGA CATTTCCAAG
GGGCCGCAAT ACGGCCACGC CGTCGTCTGG CCGGATGCCA CCGCGCCCAG GGCGATGGGC
GTCCGCAAGG TGCGGATCGT GCAGTCCGTG GATGCGGGAC GCTTCCTCGC CACGTTCCTG
CACGAGGCCC AGAGCGACAT CCATCCGTCC CATGCCTTCC CCCGGCCCTG A
 
Protein sequence
MFFCPLRRAM RLGTTRREST PREVPRRAFR RALGAGVLGL LAAWLTAGPP ARAAAPQLVI 
EDNDFLGPGG SDQQSIIPLL FNPAVRVLGF TVVTGDGWEN QESAHLRRFL DIIGRGDVPV
ADGAVYPLIN SVPMMRLREQ QYGVIPWKGA WGGIGSIDGT PATQPPLPDL KEGAPAQRPI
DDSAALFLIR QVHNHPHQVT IVAAGPLTNL ALAIRIDPTF AATAKQLVFM GGLLDTSMKS
LTGNVDFASD FNMIFDPEAA HITLTAPWPS ITVVGNVSND LMMTRDYLKR ITQKKTKMTD
YIGTYYDPLP MWDELATAIA ADPGLITSSI DAYMDIDISK GPQYGHAVVW PDATAPRAMG
VRKVRIVQSV DAGRFLATFL HEAQSDIHPS HAFPRP