Gene Gdia_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0462 
Symbol 
ID6973856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp506986 
End bp508794 
Gene Length1809 bp 
Protein Length602 aa 
Translation table11 
GC content59% 
IMG OID643389994 
Producthypothetical protein 
Protein accessionYP_002274873 
Protein GI209542644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00687245 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.107968 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCGTC CATTCGCATC CTTCACCGAG TTGTCCGCGC CCCTCGTCGG CCGCAAGACC 
GGCATCAACC ATGATCTGCA AGACCGGCTT GCAGTCATGG ACGCCCGAGC GCTTCGTATC
ATGGAACTTG CTGGAATTAG GACGCACACA CACAATGGCC GCGACATCTA CAGGATCAGT
GAAGTCACGC TGATCGTCGA GAAAATGCCG CAAGAACGTC TGCGCCGTAC GCTCCTTATA
CCTTCTCAGG CTTACGGAAT ACGGCGCGAG CGGGGCGATG AATTACGCTT CTACGCCGAG
AACCACATGA AGACGCCGCT TTGTTCTGGC ACGTTCACTT GCTCCGAGAT GACCCCCATT
GGGGGCGATC TGGTCGGAGA CATGAAGCGT CTGCGCGGGC GGATAGGTGA CATGTTGAGA
TGGTGCCGAA GAGGGCCGTT CTCTGGTTAT GGTTTCGATG TTTTGATGCT GGCGAACGAT
TTCGCCTTCC AGTTTCATCC GCTGACGGGT GTATCCGTCA ACGGTCACGT TAATTACTGC
TATACGATGA AATCACCGAT GCCACCTGAA GTCTGGGAAG ACTTCCGTCG TGCCGTTCGC
GAGAAGTTTG ATCTTGGGTT TTATGTCACT GGTCCTGTGA CTGACCTCTC GGCGCTGCTC
GACTACATGA CGAAGTTATA CAAGGCCGAG CGAACCCGGA CCGACGAAAT CTTATTCTCG
GACCTGTCTG ATGATGTCGC CGCCTGGTTT CTAGGACAGG TCGATCAGAT GTGGAACTTG
ACCCCGCTGG CTGGCTTTAA GGTGTTCCGT GCCAGCCTCA AGACGGACAA GGAGAAGGTA
GTCCGGAAGC GCGCCGTTCC GGTCCGGAAG CCGGTCAGGA CCACGGATGG CCGGTCCGTC
GATGCGCTAG TGCCGCCGCG TCCTCGCGAA CACAGCCTCG TCGTAGCCAA GAGGCTGTCG
CCTCGGATAT TGGCCGGTGC TGATGATTCG CCGTTCCCGC CTGGATTCAC CGAGGGACCG
CCGAGGGTTC GTGTTCAGGA TCAGCATGTT AGCCACCCTG TGTATGCTAA TTACGCCATC
CATTCCGATA TCGCATCGAA CCGTGAAGCT GCCGCCGCCT CGCCGCTGGG AGCGACGCCC
CCCTCCGGGG GGCTAGCATC GGGGGCCTGT CCGTCGCCTG GCAATTCTGT GTTGGATCAA
GGACTTGACG GGTCTGTGTA TGCTAATTCC GCCAACGGTT CCGATTTCGC AACGAAGAAG
TCACGCGGCC ATGTCGAGAA CCTTGTGATC GCCCGCGAGC GGTCATCGCC GACCGAGGCC
GGCATCTGGG AAACCTGGAC CCAGGTCATG AACCTGACGT TGAGGCCAGA GACAGAGGAA
GGTGAGCGAG GTTTGGCGAT CCTCCTGAGG CACCACGAGC AGGCGACGGC GCAGGCTCGG
CGGAATGGCT GGACCGGGCG TGTGGGTAAT CCCCTAGCGC GGTTGGCGGC CGGTCTCGTC
GCCGCCGATT CCGCCGAGCC CCATATTCCT GATGATCAGA GCCTTTTTAT GCACGCTAAT
TCCGCAACTC AAAACGATAT CAAAACGATT CCTGAGACCA TACCGACATC GCCCCCGTCG
CTTCCTGACG CTCCGCCATC GTCAGCATAC AGTCACCCCT CTGCCGAGGA CGTGGCTGCG
GCCCGTTCCG TCCTGCCTCC TGAGGTCCTG TCTCGCGGCC TTCCCGACGT GTATCTTGTC
ATGTTGGCAA ACGGCCAAAG AAGAGAAGCT GAAATAGCCA AGCTGCCGTC AATGATGGGG
CGGCATTAG
 
Protein sequence
MARPFASFTE LSAPLVGRKT GINHDLQDRL AVMDARALRI MELAGIRTHT HNGRDIYRIS 
EVTLIVEKMP QERLRRTLLI PSQAYGIRRE RGDELRFYAE NHMKTPLCSG TFTCSEMTPI
GGDLVGDMKR LRGRIGDMLR WCRRGPFSGY GFDVLMLAND FAFQFHPLTG VSVNGHVNYC
YTMKSPMPPE VWEDFRRAVR EKFDLGFYVT GPVTDLSALL DYMTKLYKAE RTRTDEILFS
DLSDDVAAWF LGQVDQMWNL TPLAGFKVFR ASLKTDKEKV VRKRAVPVRK PVRTTDGRSV
DALVPPRPRE HSLVVAKRLS PRILAGADDS PFPPGFTEGP PRVRVQDQHV SHPVYANYAI
HSDIASNREA AAASPLGATP PSGGLASGAC PSPGNSVLDQ GLDGSVYANS ANGSDFATKK
SRGHVENLVI ARERSSPTEA GIWETWTQVM NLTLRPETEE GERGLAILLR HHEQATAQAR
RNGWTGRVGN PLARLAAGLV AADSAEPHIP DDQSLFMHAN SATQNDIKTI PETIPTSPPS
LPDAPPSSAY SHPSAEDVAA ARSVLPPEVL SRGLPDVYLV MLANGQRREA EIAKLPSMMG
RH