Gene Gdia_0468 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0468 
Symbol 
ID6973862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp513510 
End bp514790 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID643390000 
Producthelix-turn-helix domain protein 
Protein accessionYP_002274879 
Protein GI209542650 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2856] Predicted Zn peptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00592204 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCTC GGTGGGCATT AAAACAATTC GGCTACGCTG CACTCCGCGA GGCGGTCGAT 
GAGGGTGCGA CCGTCATTAG TGTCAGTCTT GATGAGCCTG CGCGCTCGCT GCGGGAGCAG
CGTGTGAGGC TCGGACTCAC GCAAGAGCAG GTGGCTCGGG CTAGCATGCT TACGGTCAAC
GACGTGCGGC GCGCTGAGCA GCTAGGGGCG ATCTCGCCGG TTCGTCGCCT GCAACTCCTT
GCCCAAACTC TTGGGCTCAA CGATGAGCAA CTGGGCGTCC GACCAGATGC GAGCGCCGAC
CAGCAATTAG CAGTCCGCCT GCGGACGCTC GGGAGTGCCG GACACAACGT ACGCTTCTCA
CCCTCCCTGG TGCTCAAGCT TGCGGAGGCG GCGTGGACGA TATCGCGGCA AAACTTGCTC
GCGGTCGAGC TTGGTGTTGT TCCGAATATA CTCAAGCACT TTGATGCGAG CGATGATTAT
TATGCTCCAG TATGGCGCCG GGGGTATGAT CTGGCAGAAC GAACGCGCAC CCTGCTGAAC
TTAGATCCAC TTGCCCCAAT CCCATCAGTG CGTGAGCTCA TTGATCAACT GGGGATTCCG
CTGATTCAGG CCGCAATGGG AGCGGCCTTC GCAGGCGCAA CGGTAGTTAA TGGTGACGAT
CGAGGGATCG TCATCAATAC TGAAGGTGAC AACCAGAATG TCTGGGTGAG GCGGATGACC
TTGTGTCACG AGCTTGGCCA TTTGCTTTGG GATCCACCAG CTCGTCTGCG CCGCCTTCAT
GTCGACCGTT ACGATGACTT GCGCACTGCC GAGGCCGGCG GCGGCGACGA AGTAGAGGCG
CGTGCTAATG CCTTCGCCAT TTCCTTTCTG GCACCGCGTG AGGCTGTAAT CGAAATCGTC
AAGCGTGGTG CAAGTCCGAC TGATCAGGTG ATCGAGCTCA TGAAACGCTT CGGCGTTGGC
GCAACGGCCG CGAAGTATCA CATCGCGAAT GTCTCGCGCA ACTGGGGTGC CGAGGTCGAT
ACCCGGCATG TCGCATCTAA CCAGCTTCCA CCACCTGATG ATTATTGGAC AACAAACGAG
AACTGGACGG CGGACTATTT TCCCGTCGCC GGCGTTCCTA TTAGTCGTCG TGGGCGCTTC
TCCGGGTTGG TAGCTATTGC CGCTTCCCGA GAACTGATTT CGACCGATAC GGCGGCATCA
TGGTTGCAGG CGGCACCCTC GGCTTTGGCT CAGCAGTTGG CGACAATTGC CGAACTGACT
GCTCAGGATC TTGCTGTTTA G
 
Protein sequence
MTARWALKQF GYAALREAVD EGATVISVSL DEPARSLREQ RVRLGLTQEQ VARASMLTVN 
DVRRAEQLGA ISPVRRLQLL AQTLGLNDEQ LGVRPDASAD QQLAVRLRTL GSAGHNVRFS
PSLVLKLAEA AWTISRQNLL AVELGVVPNI LKHFDASDDY YAPVWRRGYD LAERTRTLLN
LDPLAPIPSV RELIDQLGIP LIQAAMGAAF AGATVVNGDD RGIVINTEGD NQNVWVRRMT
LCHELGHLLW DPPARLRRLH VDRYDDLRTA EAGGGDEVEA RANAFAISFL APREAVIEIV
KRGASPTDQV IELMKRFGVG ATAAKYHIAN VSRNWGAEVD TRHVASNQLP PPDDYWTTNE
NWTADYFPVA GVPISRRGRF SGLVAIAASR ELISTDTAAS WLQAAPSALA QQLATIAELT
AQDLAV