Gene Gdia_0642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0642 
Symbol 
ID6974039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp729300 
End bp730523 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content67% 
IMG OID643390173 
ProductCapsule polysaccharide biosynthesis protein 
Protein accessionYP_002275049 
Protein GI209542820 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3562] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.696628 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.841432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGAGTT TCCTGGTGCT GCAGGGCAAT GCCACCCCGT TCTTTTCCGA ATTGGCCGCG 
GCCCTGAAGG CCGCCGGTCA TCACGTCCGT CGCATCGCTT TCAACGGCGG CGATGTCGTC
TTTTCCTCGG ACGCCACCTG GTTTCGCGGA CGCGAGGAGG CCTTGCCCGC CTTCATCGAA
GACATCGTGG CGCGCGAACG GCCGGATGCG ATGATCCTGT TCGGCGATTG CCGACCGATC
CATCGCGTGG CGACCGAGAT CGCCCGCGAG CGGGGCATCG CCATATGGGT GTTCGAGGAG
GGGTATCTGA GGCCGGGCTG GATTACGCTG GAGCCGCACG GCGTCAATGG ATTTTCCGCC
CTGCCGCGCG ACGCCGCCGC CATCCGGGCG CGCGGGACCG CACCCTGGCC CGCCCCGCGA
TACAAGCCGC ATGCCGATTT CCTGCGGCGG TCGGTCTATG ACGTGTCCTA TCACGCCCTG
CGCGTGGCCC TGACCCCGCT TTTTCCCCAC GCGCGGTTTC ATGCCGCCAT CGACCCGTTC
GTGGAATATG CGGGCTGGCT GCGGGACTGG GCCGGGCGCC TGGTGCGCAA GCCGCCGCAG
GCGGTACTGC CCGATGGCCC CTTCATGCTG GTGCCGATGC AGATGGAAGG GGACTACCAG
CTCCGGGTCC ATTCCCCGTT CCACGGCATG GGCCAGGCGC TGGAGCAGAT TCTGGGCTCT
TTCGCCGCCC ATGCGCCCGA TACGCTGTCG CTGGTCGTGC GGCGTCACCC GCTCGATCCC
CGGCTGACGG ACTGGGAGGG GCTGGTCCGT GATCGTGCGC AGGCACTGGG CGTCGCGGAC
CGGGTTTATT TCATGTCCGA AGGGCCGCTG GAACCGGTCC TGGATTCCTG CATCGGGGTC
GTGACGGTGA ACAGCACCGT CGGGCTGCTG GCCCTGCGGC GGAACAAGCC GGTCAAAATC
CTGGGCGAGG CGATCTACGA CGTCGAGGGC TTGACCTTTT CCGGCCCGCT GGGACGCTAC
TGGCGCGAGG CCTGCGCCCC GGATGCCGGA CTGCTGGACG CCTTCTGCCG CATGCTGATC
CAGGAGGTGC TGGTCGAGGG TGATTTCTTC ACCCCCGAAG GACGGGCGCT GGCGGTCGAG
GGATCGGTTC GGCGGATCCT GTCCGCCTAT TCCGACAGGG CCTGCAATTC CCGCACGAGA
ACCGCCGCCG TCGTTTCGAT CTGA
 
Protein sequence
MLSFLVLQGN ATPFFSELAA ALKAAGHHVR RIAFNGGDVV FSSDATWFRG REEALPAFIE 
DIVARERPDA MILFGDCRPI HRVATEIARE RGIAIWVFEE GYLRPGWITL EPHGVNGFSA
LPRDAAAIRA RGTAPWPAPR YKPHADFLRR SVYDVSYHAL RVALTPLFPH ARFHAAIDPF
VEYAGWLRDW AGRLVRKPPQ AVLPDGPFML VPMQMEGDYQ LRVHSPFHGM GQALEQILGS
FAAHAPDTLS LVVRRHPLDP RLTDWEGLVR DRAQALGVAD RVYFMSEGPL EPVLDSCIGV
VTVNSTVGLL ALRRNKPVKI LGEAIYDVEG LTFSGPLGRY WREACAPDAG LLDAFCRMLI
QEVLVEGDFF TPEGRALAVE GSVRRILSAY SDRACNSRTR TAAVVSI