Gene Gdia_3229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3229 
Symbol 
ID6976668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3537497 
End bp3538477 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content59% 
IMG OID643392740 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002277572 
Protein GI209545343 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.382079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.686696 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCACG GCACTCATTC CGCCCGGAAG GCGCGCGCCG CGACACCCGT CTTTGAAGTC 
GTGCGTCCCG AACCGCGCAA CAGCTTCGTC TGGCATACGC ATGATTATCC GGCGCCCTGC
GCGCGGTGGA ACTATCACCC CGAATATGAA CTGCACCTGA TCACCAAGGG GTGCGGCCAG
TATATGGTTG GCGATTATTT CGGCTTTTTC GCGCCGGGAA ACCTCGTGCT GATCGGGCCG
AACGTGCCGC ACGGCTGGTT CAGCGACGTA ACACCCGGCG AAAGCGTACC GGACCGCAAT
ATCGTTCTTC AGGTCAACAA GTCCTGGTTC ACGGGCCTGA TGACGCTTTG TCCGGAACTT
GACGTTCTGC ACGGGCTTCT CGCGGCGTCA GGACGTGGGG TCGAATTCCT CGGCCCGGGG
GTCGCTGCCC TTGGCGCGCG CCTGGCCGGG CTCGGCACGA TGGACGATGC CGCGCGCATT
CCCGCCATCA TCGGCCTGTT GCTCGACCTG GCCCAGTGCT CTTACCGGAC GTTATGCAGC
GCGGGCCTGA CCCTGTCCCC GGATGACAGG GAACTGGAGA AAATAGATTC CATCATCAGG
AATATAATCG ACGACAACAT CGTTTTTCGT CAGCAGGCCG AAATTGCAAA AGCGGTCGGA
TTGTCCGCAC CCGCGTTTTC GCGGCAGTTC CGGCGCGCCA CCGGCGATAC GTTCGTATCC
TTCATGAAAA AGCTGCGGAT CGGCAAGGCA TGTCAATTGC TGATGACAAC AGACGCCTCG
ATTGCCGACA TCAGCGCGGC CACGGGGTTC GGCAACCTGT CCAATTTCAA CAGGCAGTTT
CTCCAGATTC GCCAGACGAC CCCTTCGCAA TACAGGCGGG ACGTCCGACG CCTGGTGAAA
CAGGATGCTG AAACAGCAAA AATTCGAAAC GACCGCCCAT ACGGAATGCA CAGTCACCAC
GCACAGCGAT ACGTACAATA G
 
Protein sequence
MVHGTHSARK ARAATPVFEV VRPEPRNSFV WHTHDYPAPC ARWNYHPEYE LHLITKGCGQ 
YMVGDYFGFF APGNLVLIGP NVPHGWFSDV TPGESVPDRN IVLQVNKSWF TGLMTLCPEL
DVLHGLLAAS GRGVEFLGPG VAALGARLAG LGTMDDAARI PAIIGLLLDL AQCSYRTLCS
AGLTLSPDDR ELEKIDSIIR NIIDDNIVFR QQAEIAKAVG LSAPAFSRQF RRATGDTFVS
FMKKLRIGKA CQLLMTTDAS IADISAATGF GNLSNFNRQF LQIRQTTPSQ YRRDVRRLVK
QDAETAKIRN DRPYGMHSHH AQRYVQ