Gene Gdia_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2023 
Symbol 
ID6975450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2244526 
End bp2245650 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content71% 
IMG OID643391553 
Producthypothetical protein 
Protein accessionYP_002276398 
Protein GI209544169 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.426657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCCT CTTTCCTGCA ACGCGCTCGC CGGGTGGCGC GGTCCGGCCC GGACCGGGTC 
GGGATCGCGG CGGTCGCGGA CGGCGTCACC ATCCTGGAAA CCTGTCCGGC CCTGCCCGCC
CGCTTCGCCG GCGGCGCGAT GCGCGGGCGG GGGGCGAATC CCTTCCTGGA CTGGACGAGT
ACGCCGGTCA CGGTGACGGT TCATGACCTG CGCGACGTCG TCTACGATGT CGATCATCGG
GTGCTGATCA AGGATGGGCG CCTGATCGCC GAGACCTGCT ACCTGCAGCC GGAGGACGCG
CTGGCCCGGG TCGTCGCCGC CCGCGCGCGA CCGGACCGCC TGGCAGGCCG GGGAATAATG
GTTCCGTGCA GCGATCATTG GCCGGGCAAT TACTATCACT GGATGGCGCA TGGCCTGCCG
GTGATCGCCG CCGCGTCGGA CCTGCCGGAT GGCGGCGCGG CACGGCTGCT GCTGCCGGCG
CTTCTGCCCT GGCAGCATCG CACGCTGCAG ATGCTGCGGC CCGGAGGATG CGCGATCGAA
CGGATCATGG CGGGGCGGCA ATACCGGATC GACCGGGTCG CCTATTGCAA CATCGTCGCC
GGGGCGGCCG ACTTCGCGGT GTCACGGCTG TGCGGGCGGG TATTCGCACG CCTGGCGGCG
GCCGTTCCCG TCGTCCGGCC GCATGGCGCG CGCCTGTATG TCGATCGCGG CGGGGCCGGC
CATCGTGCCA TCCCGAACGA GGGCGCGCTG GCCGCGCGGC TGCGCGGCCT GGGGTTCCTG
GCGGTCCGGC CCGAAACCCT GACGGTGGCC GAACAGATCG ACCTGTTCCG GGCCGCGTCG
ATGGTGGTGG GGCCACTGGG CGCCGGCATG ACCAATATCG GATTCTGCCG CCCCGGGACC
GTGGTCTACG ACCTGGTCCC GGACCATCAC GCCAACCCGT GCTTCCTGGC CATGGCCATG
CGCGGCGGCC TGGAATACTG GGCGGATCTG TTCCCGACCG GGGCGGCGCG ACAGGACCAT
ATGGCCCCCT GGGGGCAGGG GATCGACGTC GAAAGGGTGG TTCGGCGGGT GGAGGAACTG
CTCAGGGGGC TTTTGCCGGG CGCAGGTGCT GCGCCCCTGC CGTGA
 
Protein sequence
MFPSFLQRAR RVARSGPDRV GIAAVADGVT ILETCPALPA RFAGGAMRGR GANPFLDWTS 
TPVTVTVHDL RDVVYDVDHR VLIKDGRLIA ETCYLQPEDA LARVVAARAR PDRLAGRGIM
VPCSDHWPGN YYHWMAHGLP VIAAASDLPD GGAARLLLPA LLPWQHRTLQ MLRPGGCAIE
RIMAGRQYRI DRVAYCNIVA GAADFAVSRL CGRVFARLAA AVPVVRPHGA RLYVDRGGAG
HRAIPNEGAL AARLRGLGFL AVRPETLTVA EQIDLFRAAS MVVGPLGAGM TNIGFCRPGT
VVYDLVPDHH ANPCFLAMAM RGGLEYWADL FPTGAARQDH MAPWGQGIDV ERVVRRVEEL
LRGLLPGAGA APLP