Gene Gdia_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0844 
Symbol 
ID6974241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp959836 
End bp960846 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content65% 
IMG OID643390373 
Producthypothetical protein 
Protein accessionYP_002275249 
Protein GI209543020 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.655414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0599894 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACCG CCGCCCTTGG CGCCTCGATG GCGGCGCTGG GCGGCATCCG GCGCGCCGCC 
GCTAAAACCG TCAGCCATCC CCGATTCCGT CTGGTTTTCG TCAACCATGT CACGACCAAT
CCGTTTTTCA CGGCGACACA GTATGGACTG CGCGACGCGG CGGCCCTGGT CGGGGCGGAT
ACCCAGTGGA CGGGGTCGGA AAACAGCATC GCGGCCGAGA TGATCACCGC GATCAATGCC
GCCATCGCCG CCAAGGCCAG CGCGATCGCC GTCTGCCTGG TCGATCCGCA TGCCTTCAAC
GATCCGGTCG AACGCGCGCT GGCCGCCGGA ATCCCGGTCT TCGCCTATAA CGCGGACGCG
CCGGCGGGGT CGGGCAACAA GCGCCTGGCC TATATCGGGC AGGATCTGTT CAAGGCCGGG
CAAATGATGG GCCAGCGGAT TCTCGACCTG GTGCCCGGCG GGCGCGTGGC GCTGATGATC
GCGACACCCG GGCAGTTGAA CATCCAGCCG CGTATCGACG GCGCGCAGGA CATGCTGCGC
AAGAGCGGCC GGTCGTACCA GATCGACATC GTCGCGACGG GCGCCACGGT GAACGAGGAA
CTGTCCAAGG TGAAGGCCTA TTACCTGGGC CATTCGGACG TGAAGGGAAT GTTCGCCGTC
GATGGCGGCA CCACGCAGTC CGTCGCCGAC ACGATGGCGC AGTACGGCCT GGCCGCCAAG
GGCGTGCGGG GCGGCGGCTT CGACCTGCTG CCGCGCACGC TGCGCCTGAT CAATGACGGG
CACCTGGATT TCACCATCGA CCAGCAACCC TATCTGCAGG GCTACTACAC GGTCATGGAA
ATGTACACCT ACCTGATGTC CGGGGGCCTG GTGGGACCGG CGGAAATCAA TACCGGCCTG
AAATTCGTGA CCAAAGGGGA TGTGGCGCCA TACCTGGCGA CCAAGAGCCG CTACGAGGGC
AGTTCGAGCG AGGCGCAGTT CATCCCGCGC TCCGGCCCGA TCCAGTCCTA G
 
Protein sequence
MQTAALGASM AALGGIRRAA AKTVSHPRFR LVFVNHVTTN PFFTATQYGL RDAAALVGAD 
TQWTGSENSI AAEMITAINA AIAAKASAIA VCLVDPHAFN DPVERALAAG IPVFAYNADA
PAGSGNKRLA YIGQDLFKAG QMMGQRILDL VPGGRVALMI ATPGQLNIQP RIDGAQDMLR
KSGRSYQIDI VATGATVNEE LSKVKAYYLG HSDVKGMFAV DGGTTQSVAD TMAQYGLAAK
GVRGGGFDLL PRTLRLINDG HLDFTIDQQP YLQGYYTVME MYTYLMSGGL VGPAEINTGL
KFVTKGDVAP YLATKSRYEG SSSEAQFIPR SGPIQS