Gene Gdia_0248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0248 
Symbol 
ID6973640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp272471 
End bp274465 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content70% 
IMG OID643389779 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_002274660 
Protein GI209542431 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.250562 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGGGA CAATGGCCTC GCAACGCCCT GATCCCCCTC TGCCGGCCGA CAGGCGGCGC 
CCGGTCAACC GCATCGCCGT CAATGTCCTG CTGGACGGCC TTCTCGCCGC CCTGTCGGCG
CCGCTGGCAC GCTGGCTGGC CGACCCGCGC GGCGGCCTGC TGCATCCGCT CTGGTTCGTC
GCGGGCGGCG CCATCACCCT GATCATCAGC GGCCTGCCGT TTCGTGTGCA GCAGCAATAT
TGGCGGTTTT CGGGGGTGGG CGACCTGCTG AACATCGCGG GTGCCTCGGT GGCCAGCGCC
ATCCTGTTCG CGCTGGGCCT GCATGTCACC GGCTTTCCGA TCCCGACCCC GACCTTTCCC
ATCATCCATG CCCTGGTGCT GCTGACGCTG ATGGGCGGGA TACGGATCAT CTACCGGCTA
TCCTATCGCC GCACGGCGCG GCGGCAGGCA TCCAGCCAGG TGATCCTGGT CGGCTCGGAC
AATTCGGCCG ACCTGTTCAT CCGCGCGGTC GAACGCGCGC CCGATCCCGC CTTTCGCGTG
GCCGGCATCG TCACCCAGGG GCGGCGGCAG GCCGGGCGGC GCATCCACGG CATTCCCATC
CTGGGCCATG TCGAGGGAAT GGACGACATC CTGGCCCGCC TGGACGGGCA TGGCGGCCGG
CCCGAGCGGC TGGTCATCAC CGATGCCGCG TTCCGGGGCG AGGACCTGGC CGAGATCATG
CAGGTGGCCG AGCGGCGGGG ACTGACCGTC CTGCGGGCGC CGGTCCTGAC CGACCTGACG
CCCGCCGACC GGCTGGAACT GCGCCCCGTC GCGATCGAGG ACCTGCTGAA CCGTCCCCAG
GTCGGGCTGG ACCATGACGG CATGGCGGCC CTGCTGGGCG GCCGCACCGT CTTGGTGACC
GGGGCCGGCG GCTCGATCGG GTCGGAACTG GCGCGGCAGA TCGCGGAATA CGAGCCCGGC
CGCCTGCTGC TGCTGGACCA TGGCGAATTC GCGCTGTGGC AGATCGACGT GGAACTGTCG
GAACGCGCGC CCCGGCTGCG GCGCGAGACG ATCCTGGCCG ATATCCGCGA CCGGGCGCGG
ATCGAACAGG TCTGCGCGCA GTACCGTCCC GAACTGGTGT TCCACGCGGC GGCACTGAAA
CACGTGCCGA TGGTCGAGGC CAATCCGTGC GAGGGGGTGC TGACGAACGT CATCGGCACC
CGGATCGTCG CGGACGCGGC GGCGCGGCAT GGCGCCCGCG CCCTGGTCAT GGTCTCGACG
GACAAGGCGG TGAATCCGTC CAGCCTGATG GGGGCCTCCA AGCGCGCGGC CGAGATGTAT
TGCCAGGCCC TGGATGTCGA GGCCCGCACC GGCGACGAAT CCGGCGTGAT GCGGTGCGTC
ACCGTACGGT TCGGCAACGT CCTGGGTTCG ACGGGATCGG TGGTGCCCCT GTTCCGCCGG
CAACTGGAAC GGGGCGGGCC GCTGACGGTT ACCCACCCCG ACATGCAGCG CTATTTCATG
ACGGTGTCCG AGGCCGTGGG GCTGGTGCTG CAGGCCAGCG TGCGCGGCAC CATCCGCGCC
CGCGCGGGCA GCGGCACGGA CACGCTGCTG CGCAGCGGCG GCATCTTCGT CCTGGACATG
GGCAAGCCGA TCCGGATCGT CGACCTGGCG CGGCAGATGA TCCGCCTGGC CGGCCTGCGC
CCCGACCAGG ACGTGCCGAT CCGCTTCACC GGCCTGCGTC CGGGGGAAAA GCTGTTCGAG
GAACTGTTCC ATGGGCGCGA GGCCCCGGTC GCGACCGACC ATCCGGGCCT GAAGATGGCG
ACCCCCCGCA TGGTGGACCG TGGGGCCGTC GGCCTGGCGA TCGACACGCT GGAAGCCGCC
TGCCGCGCCG AGGATACTCC GGCCGTCCTT GCGCTGATCG GGAGCCTCGT TCCCGAATTC
GCCCATAACC CGACCGGCGA CGTGCGCGAT AGGCTCGCGG ACCCGACCGA CGCCGAAAGG
ACCATGACCC CATGA
 
Protein sequence
MVGTMASQRP DPPLPADRRR PVNRIAVNVL LDGLLAALSA PLARWLADPR GGLLHPLWFV 
AGGAITLIIS GLPFRVQQQY WRFSGVGDLL NIAGASVASA ILFALGLHVT GFPIPTPTFP
IIHALVLLTL MGGIRIIYRL SYRRTARRQA SSQVILVGSD NSADLFIRAV ERAPDPAFRV
AGIVTQGRRQ AGRRIHGIPI LGHVEGMDDI LARLDGHGGR PERLVITDAA FRGEDLAEIM
QVAERRGLTV LRAPVLTDLT PADRLELRPV AIEDLLNRPQ VGLDHDGMAA LLGGRTVLVT
GAGGSIGSEL ARQIAEYEPG RLLLLDHGEF ALWQIDVELS ERAPRLRRET ILADIRDRAR
IEQVCAQYRP ELVFHAAALK HVPMVEANPC EGVLTNVIGT RIVADAAARH GARALVMVST
DKAVNPSSLM GASKRAAEMY CQALDVEART GDESGVMRCV TVRFGNVLGS TGSVVPLFRR
QLERGGPLTV THPDMQRYFM TVSEAVGLVL QASVRGTIRA RAGSGTDTLL RSGGIFVLDM
GKPIRIVDLA RQMIRLAGLR PDQDVPIRFT GLRPGEKLFE ELFHGREAPV ATDHPGLKMA
TPRMVDRGAV GLAIDTLEAA CRAEDTPAVL ALIGSLVPEF AHNPTGDVRD RLADPTDAER
TMTP