Gene Gdia_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0471 
Symbol 
ID6973865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp516237 
End bp517433 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content62% 
IMG OID643390003 
Productintegrase family protein 
Protein accessionYP_002274882 
Protein GI209542653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.116709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.000616461 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAGCA GCAGCGATAA AACACGAGAT CAGGCCGTGA CCAGGGCCAA GGGGGACATA 
GACCCTGCGA CTGGTAAGAC CTTGCCGGCT GGCATCTCGT GCCGCGGCCC TAAGCAGTAT
CAGGCTCGCA AGCAAGTCGA TGGCAAGCGC TACCGCAAGA CCTTCTCGAC CGCCGCGCTG
GCTCGACGGT GGCTCAACGA GACCGCCGCC AAGGTCGAAC TCGGCCAATT CAAGGACACC
AGCGCGCTCG ACAAGCAGAC CATCGGCGAC CTTGTGGACA GGTATGCCAA GGAGTGCATG
GATGGGCGTG GCGCTGACCT GACAGGGCAT ATCCCGGCCA TTCTCCGGGA CAAGGATCTG
CCTGGAGTCC GGCTGTCCAA ATTCTGCCTG GCGGATGTGC GCGGCTTCCG GGATCGGATG
ATGTCTGCCG ACTATTCCCC GGCCACGGTC GTCAAACGTC TAAACCTACT TGCCTCGGTC
ATCCAGCACG CCATCAGCGA GTGGGATACC TCCATCGTCA ACTACGCCTC CGGACGGTTC
GTGAAGCGGC CGGAAGGTGC GGACAAGAAG CGCAATAGAC GGCTCGACGA AGACAAGGAC
AAGGACGGGA AGACGGAGTT CGATAGGCTG ATCGTGGCTG TCTCGGACTC CGTCTATCCG
GACGATGTGT GGCTGGTCCG CTGGTCGATC GAGCAGGGCA CAAGACGCGG TGAGGCGATC
GGCCTGCGAT GGTGCGATGT CGATATCGAA CGCAGCTTGA TCAAGCTGGG GGGCGAGTCC
GGCAAGACCA AGACGCACAA GACCCAGGAA GAACAAGGCC CTGAAATCCG CCCGTTGACG
CCGGGAGCAA GGCGACTCCT GCTTGAGAAA CGGGACACAT ACGAGACGCC GCCGGAGCCC
GGCGACAGCG TGTTCAGCGT AGGCAAAGAG TCTACATTCA GCATGCGTTA CGGGCGGATG
GTCAAACGCA CCGGGCTCCA CAACCTGACG TTCCATGACC TGCGCCACGA AGCGACCAGC
CGCCTCGCGC GTCTGCTGCC GAACCCGCTG GACCTCAAGA GGGTCACGGG ACATCGTGAT
CTGAAGAGTC TGGACCGGTA TTATCAGCCG GTCCCGGAAT CCATCAGCAA GCAGATCGAG
GAGGCTGAGC GGCTGGCCGG CATCATCGCC GCCGAGGAAG GCGACGATGA CGAGTAA
 
Protein sequence
MASSSDKTRD QAVTRAKGDI DPATGKTLPA GISCRGPKQY QARKQVDGKR YRKTFSTAAL 
ARRWLNETAA KVELGQFKDT SALDKQTIGD LVDRYAKECM DGRGADLTGH IPAILRDKDL
PGVRLSKFCL ADVRGFRDRM MSADYSPATV VKRLNLLASV IQHAISEWDT SIVNYASGRF
VKRPEGADKK RNRRLDEDKD KDGKTEFDRL IVAVSDSVYP DDVWLVRWSI EQGTRRGEAI
GLRWCDVDIE RSLIKLGGES GKTKTHKTQE EQGPEIRPLT PGARRLLLEK RDTYETPPEP
GDSVFSVGKE STFSMRYGRM VKRTGLHNLT FHDLRHEATS RLARLLPNPL DLKRVTGHRD
LKSLDRYYQP VPESISKQIE EAERLAGIIA AEEGDDDE