Gene Gdia_1531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1531 
Symbol 
ID6974941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1706735 
End bp1708489 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content64% 
IMG OID643391062 
ProductLevansucrase 
Protein accessionYP_002275925 
Protein GI209543696 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.515752 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.219322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCATG TACGCCGAAA AGTAGCCACG CTGAATATGG CGTTGGCCGG GTCCCTGCTC 
ATGGTGCTGG GCGCGCAAAG TGCGCTGGCG CAAGGGAATT TCAGCCGGCA GGAAGCCGCG
CGCATGGCGC ACCGTCCGGG TGTGATGCCT CGTGGCGGCC CGCTCTTCCC CGGGCGGTCG
CTGGCCGGGG TGCCGGGCTT CCCGCTGCCC AGCATTCATA CGCAGCAGGC GTATGACCCG
CAGTCGGACT TTACCGCCCG CTGGACACGT GCCGACGCAT TGCAGATCAA GGCGCATTCG
GATGCGACGG TCGCGGCCGG GCAGAATTCC CTGCCGGCGC AACTGACCAT GCCGAACATC
CCGGCGGACT TCCCGGTGAT CAATCCGGAT GTCTGGGTCT GGGATACCTG GACCCTGATC
GACAAGCACG CCGATCAGTT CAGCTATAAC GGCTGGGAAG TCATTTTCTG CCTGACGGCC
GACCCCAATG CCGGATACGG TTTCGACGAC CGCCACGTGC ATGCCCGCAT CGGCTTCTTC
TATCGTCGCG CGGGTATTCC CGCCAGCCGG CGGCCGGTGA ATGGCGGCTG GACCTATGGC
GGCCATCTCT TCCCCGACGG AGCCAGCGCG CAGGTCTACG CCGGCCAGAC CTACACGAAC
CAGGCGGAAT GGTCCGGTTC GTCGCGTCTG ATGCAGATAC ATGGCAATAC CGTATCGGTC
TTCTATACCG ACGTGGCGTT CAACCGTGAC GCCAACGCCA ACAACATCAC CCCGCCGCAG
GCCATCATCA CCCAGACCCT GGGGCGGATC CACGCCGACT TCAACCATGT CTGGTTCACG
GGCTTCACCG CCCACACGCC GCTGCTGCAG CCCGACGGCG TGCTGTATCA GAACGGTGCG
CAGAACGAAT TCTTCAATTT CCGCGATCCG TTCACCTTCG AGGACCCGAA GCATCCCGGC
GTGAACTACA TGGTGTTCGA GGGCAATACC GCGGGCCAGC GTGGCGTCGC CAACTGCACC
GAGGCCGATC TGGGCTTCCG CCCGAACGAT CCCAATGCGG AAACCCTGCA GGAAGTCCTG
GATAGCGGGG CCTATTACCA GAAGGCCAAT ATCGGCCTGG CCATCGCCAC GGATTCGACC
CTGTCGAAAT GGAAGTTCCT GTCGCCGCTG ATTTCGGCCA ACTGCGTCAA TGACCAGACC
GAACGGCCGC AGGTGTACCT CCATAACGGA AAATACTATA TCTTCACCAT CAGCCACCGC
ACGACCTTCG CGGCCGGTGT CGATGGACCG GACGGCGTCT ACGGCTTCGT GGGTGACGGC
ATCCGCAGTG ACTTCCAGCC GATGAACTAT GGCAGCGGCC TGACGATGGG CAATCCGACC
GACCTCAACA CGGCGGCAGG CACGGATTTC GATCCCAGCC CGGACCAGAA CCCGCGGGCC
TTCCAGTCCT ATTCGCACTA CGTCATGCCG GGGGGACTGG TTGAATCGTT CATCGACACG
GTGGAAAACC GTCGCGGGGG TACCCTGGCG CCCACGGTCC GGGTGCGCAT CGCCCAGAAC
GCGTCCGCGG TCGACCTGCG GTACGGCAAT GGCGGCCTGG GCGGCTATGG CGATATTCCG
GCCAACCGCG CGGACGTGAA CATCGCCGGC TTCATCCAGG ATCTGTTCGG CCAGCCCACG
TCGGGTCTGG CGGCGCAGGC GTCCACCAAC AATGCCCAGG TGCTGGCGCA GGTTCGCCAA
TTCCTGAACC AGTAA
 
Protein sequence
MAHVRRKVAT LNMALAGSLL MVLGAQSALA QGNFSRQEAA RMAHRPGVMP RGGPLFPGRS 
LAGVPGFPLP SIHTQQAYDP QSDFTARWTR ADALQIKAHS DATVAAGQNS LPAQLTMPNI
PADFPVINPD VWVWDTWTLI DKHADQFSYN GWEVIFCLTA DPNAGYGFDD RHVHARIGFF
YRRAGIPASR RPVNGGWTYG GHLFPDGASA QVYAGQTYTN QAEWSGSSRL MQIHGNTVSV
FYTDVAFNRD ANANNITPPQ AIITQTLGRI HADFNHVWFT GFTAHTPLLQ PDGVLYQNGA
QNEFFNFRDP FTFEDPKHPG VNYMVFEGNT AGQRGVANCT EADLGFRPND PNAETLQEVL
DSGAYYQKAN IGLAIATDST LSKWKFLSPL ISANCVNDQT ERPQVYLHNG KYYIFTISHR
TTFAAGVDGP DGVYGFVGDG IRSDFQPMNY GSGLTMGNPT DLNTAAGTDF DPSPDQNPRA
FQSYSHYVMP GGLVESFIDT VENRRGGTLA PTVRVRIAQN ASAVDLRYGN GGLGGYGDIP
ANRADVNIAG FIQDLFGQPT SGLAAQASTN NAQVLAQVRQ FLNQ