Gene Gdia_3378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3378 
Symbol 
ID6976824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3700914 
End bp3703805 
Gene Length2892 bp 
Protein Length963 aa 
Translation table11 
GC content75% 
IMG OID643392894 
Producthypothetical protein 
Protein accessionYP_002277719 
Protein GI209545490 
COG category 
COG ID 
TIGRFAM ID[TIGR02302] conserved hypothetical protein TIGR02302 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.344689 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.143577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGGCG GGACCGCTCC CCCCCAGGAT GTCGCGGATA CGGGGGCGGC GCTGTCGTCG 
TTTCCGGTCC GGCTGGCGGC CGCGCGGTAT CGCGCGCGGC AGGTCCTGTG GGTCGAGGGC
GCCTGGCCGG TCCTGCTGCC GGTGCTGGGC GGACTTGCCG CCTATCTGAT CGCGGGCCTG
CTGGGCCTGC CGCAGGACCT GCCCGATCTG GCGCATGCGG CGCTTCTGCT GGCGCTGGCC
GGTGGCGCGG CCGCGTGGAT CGTCTGGCGC GGGCGCCGCG TGACCGCGCC CACCCCCGCC
GGAGTGGACC GGCGGATCGA ACGCGCGTCG GGTCTGGCGC ACCGCCCGTT GCAGACCCTG
GGCGACCACC CCGCCGGTGC CGATGCCGCT CCCCATGATG CCGCCGCCCG GGTCGAACGG
GTCGCCCTGT GGGACGCGCA TCTGCGGCGG ACGGGGCGGG CGATCGGCCG GCTGCGGGCC
GGCTGGCCGC GCCTGTCGCT GGCGGCGCAT GATCCGTGGC GCGTGGGCTA TGTGCTGCTG
CCGGGATTGC TGGCCGCACT GCTGTGGGCG GGCGGCGACG CCCGCGGCCG GCTGGAAGCC
GCCTTCTGGC CTGGCCTGGA CGATCCGGGG GCGCCGCGCC CGCATATCCA GGCCTGGATC
ACGCCGCCAT CCTACGCCCC CGGGGCCCCG GTCTTCCTGG ATGATCGCAC GGGCCAGGCC
ACCGTGCCCC AGGGGGCGGT GCTGAGCATC AGCGTGACCG ACCTGCGCGG CCGTCCGTCG
CTGCGCGTCG CCGCCACCGG ATCGGCATCG GGCCCGGCAG TGGGGCCCGA CCGGTTTCGC
GCCCTGGGGG CGGAAAGCTG GTCGGCGGAC GTTCCGCTGC TGGCCAGCGC GTCGCTGACC
CTGCGCGGAC GGGGGCGGTC GTTCAGTCGG TGGACCGTGA CTGTCCTGCC CGATGCGGCG
CCGACGGTCG CATGGGGGGC GGGTGCCGGC GCCGGACGGG GCGAATGGCG CACGCGGCTG
CCCTATGCCG CGCGGCAGGC CTACGGCATC ACGTCGCTGC GGGCCGAGCT GCGTCTGGCC
GGCGGCGAAA AGGGGGGAGC CCCGCGCGTG CTGACCGTGC CGATTCCGAT CGACGGCCAC
CCGAAGGACG TCACGGGCAT CGCCATGCCC GACCTGTCCG CCGACCCGTG GGCGGGCGAG
GAGGTCGTGG GCCGGCTGGT GGCAACCAGC GCCAGCGGCC ATGAGGGCGT CAGCCCCGAA
GCCCGGTTCC ACCTGGGCGC GCGCCTGTTC CGCAGCCCGA TGGCCAAGGC GGTGCTGGAT
GTCCGGCGAC GGGTCGCGAC GGGCCGCGAG CGCCGCACCG CCGCCGCGAG CGACCTGATG
GCGCTGGGCG AAACGCCCGA TCCGTTCCAG AACGACGCCG GCCTGCTGCT GAACCTGACC
AGCGCCGCCG CCCTGCTGGA AAGCCCGGAT GTCGATCCGC ACGCGGCGGT GGACCAGGCG
GTGGCGCGGC TGTGGTACCT GGCGCTGGAG ATCGAGGACG GCCGGCAGGG CGGAAGCGCC
GCGGCGCGAG CGGCCCTGGA TGTCCGCGCG GCGCAGGACG CCGTGGCGGC GCAGTTGAAC
CGCATGCGCG CGCTGGGGGC GCAGGGCCAG TCGCCCGAGG AACAGGCCGA ACTGCAGCGC
CGGATGGAGA CGCTGCGCCA GGCGATCATG CGCCGGATGC AGGCGCTGGC GCAGCAGGCC
GTGCAGTCGC ACACCGCCAT ACCCGACCTG CAGGGCCTGA CCCGCAACGG CGACCAGGCC
CTGTCGCGCA TGATGCAGCA GATGCAGGAC GCCGCCCGCA ACGGCCGCTC GGCCGAGGCG
ATGCAGGCGT TGCAGCGCAT GGAAGACATG CTGGAGCACA TGCGCTCCGC CACGCCGCAG
GACCTGGCCG ACATGGCGCG CCAGATGCAG GCCCGCCAGC AGGCCAACGA ACAGCGCGAC
GCGCTGCAGG ACCTGATCCG CCGGCAATCC GGCCTGCTGG ACCACAGCCA GTCTCGCCTG
GACCGCGTCC GCCACGCCCA GGAACGCGCC GAGGCCGCGC GCCGCGCCGC CCAGGGCGAG
ATGCCGGGAC AGGGAATGGA TGGCGACCTG GCCAGCATGC CCACCGCCGA ACTGCTGCGC
CGGCTGGGGC TGCGCCCCCC GCCGGACATG CAGGGGCCGC CGGCCGACGA GCCGCAGCCC
GGAGACGCCG GGCCGGCGCA GGGGCCCGAC GCGCCGCCGT CGGGGGATGC GTCCGCACCC
GGCGCGCCCA ATCCGCCGAA CGAGGAGGTC CGGCACGCCG ACCGCGCCGT GCAGCACGCG
CTGGGGCGGG CGCTGGACGA ACTGGGGCAG GAATTCAAGG GCCTGACCGG AAAGGACGCG
CCGTCCGGCT TCGCCGATGC CGGGGGCGCG ATGAAGGACG CGCGCGCCGC CCTGGCCCAG
GGCAACGACA CCGCCGCCGC CGAGGCCCAG CGCAAGGCCC TGGCCGACCT GCAGAAGGGC
GACCAGCAGA TGCGCCAGGC GATGAAGGGC TCGGGCAAGG GCGGCGCGAC CAGCTTCCTG
CCCGGCTTCG CCAGCGGATC GGGCGAGGGC GGCCAGGGCG AACCGGGCGA TAGCGGCGAC
AGTGCCCAGG GAAGTGATCA GGGAAGTGAC CAGGGAGGCG ACCAGGCGGA CGACCAGCAC
GGCGACCGGG ACCCGCTGGG CCGCCGGACC GGCGAGGGCA AGGACGGGCT GGATTCCGAC
ACCCACGTGC CGGATACGAT GTCGCGCGAA CGCGCCCGGG AGATCGAGCA GGAACTCCGC
CGCCGCGACT CCGACCGCAC CCGCCCGCGC GAGGAACTGG ATTACCTGGA CCGGCTGCTG
AAATCCTTCT GA
 
Protein sequence
MTGGTAPPQD VADTGAALSS FPVRLAAARY RARQVLWVEG AWPVLLPVLG GLAAYLIAGL 
LGLPQDLPDL AHAALLLALA GGAAAWIVWR GRRVTAPTPA GVDRRIERAS GLAHRPLQTL
GDHPAGADAA PHDAAARVER VALWDAHLRR TGRAIGRLRA GWPRLSLAAH DPWRVGYVLL
PGLLAALLWA GGDARGRLEA AFWPGLDDPG APRPHIQAWI TPPSYAPGAP VFLDDRTGQA
TVPQGAVLSI SVTDLRGRPS LRVAATGSAS GPAVGPDRFR ALGAESWSAD VPLLASASLT
LRGRGRSFSR WTVTVLPDAA PTVAWGAGAG AGRGEWRTRL PYAARQAYGI TSLRAELRLA
GGEKGGAPRV LTVPIPIDGH PKDVTGIAMP DLSADPWAGE EVVGRLVATS ASGHEGVSPE
ARFHLGARLF RSPMAKAVLD VRRRVATGRE RRTAAASDLM ALGETPDPFQ NDAGLLLNLT
SAAALLESPD VDPHAAVDQA VARLWYLALE IEDGRQGGSA AARAALDVRA AQDAVAAQLN
RMRALGAQGQ SPEEQAELQR RMETLRQAIM RRMQALAQQA VQSHTAIPDL QGLTRNGDQA
LSRMMQQMQD AARNGRSAEA MQALQRMEDM LEHMRSATPQ DLADMARQMQ ARQQANEQRD
ALQDLIRRQS GLLDHSQSRL DRVRHAQERA EAARRAAQGE MPGQGMDGDL ASMPTAELLR
RLGLRPPPDM QGPPADEPQP GDAGPAQGPD APPSGDASAP GAPNPPNEEV RHADRAVQHA
LGRALDELGQ EFKGLTGKDA PSGFADAGGA MKDARAALAQ GNDTAAAEAQ RKALADLQKG
DQQMRQAMKG SGKGGATSFL PGFASGSGEG GQGEPGDSGD SAQGSDQGSD QGGDQADDQH
GDRDPLGRRT GEGKDGLDSD THVPDTMSRE RAREIEQELR RRDSDRTRPR EELDYLDRLL
KSF