Gene Gdia_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1803 
Symbol 
ID6975225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1996523 
End bp1997458 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content72% 
IMG OID643391328 
Productputative extracellular solute-binding protein, PotD/PotF family 
Protein accessionYP_002276178 
Protein GI209543949 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.826481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.262501 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGTTCT GTCTCACGGT ACTGGTCGGG AACGATGCCC GGGCCGGATG GCGGGCGCAC 
GCCCTGGTTG TCGAAGGCTG GGGCGGCGCC CTGGGCAAGG CGCAGGACCA GGCCTTCTTT
CGTCCCTTCG CCGCCAGTAC GGGGACCGGA ATCCTCCGCT ATGTGTGGGA TGGCGGCAGC
CTGCCCGCTC CGGCGGGCCG GCATGCCTGG GCCCTGGCCC TGGTGGAGGA CAGCACGGCC
CGCATCGCCT GCATGCAGGG CCGCCTGCAG CGCCTGGGCG GCAGCCCCGG CAGCGCGGAT
GCGTGCGGCG TGCCCGCGCT GCATGACGGC ATTGCCCTGG CATGGGACCG TGGCCGCATT
CCGGCCGCCC CGCACTGGAG CGATTTCTGG AACATCGTCC GCTATCCCGG CAAGCGCGGC
CTGCGCAAGG ACCCGCGCTC GACGCTGGAA ATCGCCCTGA TGGCCGACGG CGTGGCGCCG
TCGGACGTCT ATACCGTGCT GGCGACGCCC GAGGGGGTCG ACCGCGCGTT CCACAAGCTC
AGCCAGTTGC GTCCCTATAT CGTCTGGTGG ACCAGCGCCG CGGAATCCGC GCGGATCATA
GGTGACGGCA GCGTGCTGAT GACCAGCGCC GCGGGGGGCG AGGTCGCGGC GTCGGCCAGT
TCCGGCCACC GCGATGTCGG CCTCCAGTGG GCGCAGAGCC TGGATGACGG CCTGTCCTGG
GGTGTCGCAC CGGGGCTGGA CAGCACGGTT CGCGACCGGG CCCGCGCCCT GCTGCATTAT
ATGTCCCAGC CCGAGCAGAT CGCGCGCTTC GCCGGCCTGT ACCACGCCCG CCCCGACGAT
CCGTCGCTGC AGCCGATGCC GATGGACGCC GCGTTCTGGC AGGCCCATCT GCCCGGGTTG
GCCAAGCGGT TCGCGGACTG GCTGGCGACG CCGTGA
 
Protein sequence
MLFCLTVLVG NDARAGWRAH ALVVEGWGGA LGKAQDQAFF RPFAASTGTG ILRYVWDGGS 
LPAPAGRHAW ALALVEDSTA RIACMQGRLQ RLGGSPGSAD ACGVPALHDG IALAWDRGRI
PAAPHWSDFW NIVRYPGKRG LRKDPRSTLE IALMADGVAP SDVYTVLATP EGVDRAFHKL
SQLRPYIVWW TSAAESARII GDGSVLMTSA AGGEVAASAS SGHRDVGLQW AQSLDDGLSW
GVAPGLDSTV RDRARALLHY MSQPEQIARF AGLYHARPDD PSLQPMPMDA AFWQAHLPGL
AKRFADWLAT P