Gene Bind_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2118 
Symbol 
ID6201042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2417681 
End bp2418781 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID641706104 
ProductAraC family transcriptional regulator 
Protein accessionYP_001833227 
Protein GI182679081 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0350] Methylated DNA-protein cysteine methyltransferase
[COG2169] Adenosine deaminase 
TIGRFAM ID[TIGR00589] O-6-methylguanine DNA methyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0669009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAGC ATGCTCTTCC AAGGGGTACA CAGCCCGCCG TCCCGATCGA GGATGATCCG 
CGTTGGATAG CCGTTGTCGA TCGCGATTCT CGTTTTGACG GCACCTTCGT CTATTCCGTG
AAAACGACAG GTATTTATTG CCGGCCGTCC TGCCCCTCAC GTCTGGCGAA GCCAGGCAAT
ATCCGTTTTC ATGCTCATTG CGCGGCCGCG GAAAAAGCAG GGTTTCGGCC GTGCCTCAGG
TGCCGTCCAC ATGAGGCGTC GCTCGCGCAC GATCATGCCG ATCTGATCGT TGCGGCCTGC
CGCCGGATTG AAACGGCGCA GAAGCAACTG AGCCTGGATC AATTGGCGAA GGCCGCCGGC
CTGAGCCCGT TTCATTTTCA CCGATTGTTC AAATCGATCA CCGGCCTGAG CCCGAAAGCC
TATGGCGCTG CCCATCGCAT GAACAAGATC CACAAAGCGC TCAGTGCGGG TGAGGAAAGT
GTGACGGCAA CGATCTACGC CTCTGGATAT CAGTCATCGA GCCGTTTCTA TGCCACCTCA
AAGGAGATGT TGGGGATGAC CGCGACTGCA TTCCGAGAAG GAGGCAACTT GGCCGAAATC
CGCTTCGCGC TAGGTGAAAC CTCGCTCGGC TCCCTTCTTG TGGCCTGTAG CGCGCAAGGC
GTTTGTGCCA TTTTCCTTGG CGATGATCCG GAGGCGCTCG TCCATGAATT GCAGGGGCGG
TTTCCGAAAG CGCATCTCAT CGGCGGTGAC GAAGGTTTCG AGGCTCTGGT CGCCAAGGTC
GTCGGTTTCG TCGAAACGCC CGCACGCGGC CTCGATCTGC CGCTGGATAT TCGTGGTACG
GCGTTTCAGC ACAAGGTCTG GCAGGCCTTG TGTGAAGTTC CTTTCGGAGA GACTGCCAGC
TACACTGACA TCGCCCGGCG GATTGGCGCG CCCAAGGCCG TCCGCGCCGT GGCGCAAGCC
TGCGCTGCTA ACAAGATCGC CGTGGCTATC CCCTGTCATC GTATTTTGCG TACTGACGGA
AGCTTGTCTG GCTATCGCTG GGGCGTCGAG CGCAAGCGCG CCTTGCTTTT GAAGGAGGGC
GCTACGAAGA ATTCGAACTG A
 
Protein sequence
MRQHALPRGT QPAVPIEDDP RWIAVVDRDS RFDGTFVYSV KTTGIYCRPS CPSRLAKPGN 
IRFHAHCAAA EKAGFRPCLR CRPHEASLAH DHADLIVAAC RRIETAQKQL SLDQLAKAAG
LSPFHFHRLF KSITGLSPKA YGAAHRMNKI HKALSAGEES VTATIYASGY QSSSRFYATS
KEMLGMTATA FREGGNLAEI RFALGETSLG SLLVACSAQG VCAIFLGDDP EALVHELQGR
FPKAHLIGGD EGFEALVAKV VGFVETPARG LDLPLDIRGT AFQHKVWQAL CEVPFGETAS
YTDIARRIGA PKAVRAVAQA CAANKIAVAI PCHRILRTDG SLSGYRWGVE RKRALLLKEG
ATKNSN