Gene Daro_3798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3798 
Symbol 
ID3567954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4082405 
End bp4084081 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content62% 
IMG OID637682273 
Producthelix-turn-helix, Fis-type 
Protein accessionYP_286997 
Protein GI71909410 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value0.852136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00299193 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACCA TTCAGTACCC CGAGTCGAAT GACCTGCGCA ATCTGGTCAA GTTTTGCGAA 
AAGGATGGAA CGATCTGGCT GGCCGAGAAC CGCATGGTCC TCATGCATAC CTCGGCACTC
GGCGCCTTGC GCCGCGAACT GGTCGGCTCG GTTGGCAAGG AACACGCCCG CCGCCTGCTG
ACCCGCATGG GCTACGCCGC CGGTGTCTGC GATGCCGAAC TGGCCAAGCG CATCCGCGGC
AACCTGTCGA TGCAGGACGC CTTCTTCACC GGCCCGCAAC TGCACATGCT CGAAGGTATC
GTGCGCGTCA CGCCGGTGCA CATGAACCTC GATTTCGACA AGGGCACCTT CTCCGGTGAG
TTCCTGTGGG AAAACTCCTG GGAGCAGGAC ATCCACCTGC GCGAACTCGG GCAGTCGGAT
GAACCGGTCT GCTGGTCGCA GATCGGCTAT GCCTCGGGCT ACACCTCGGC CTTCATGGGG
CGTTTCATCC TGTTCAAGGA AATTGAATGC GTGGCCTGCG GCCACAACAA CTGCCGCATC
GTCGGCAAAC CGCTGGAAGA ATGGGAAGAC GCCGAGCAGC ACGCCAGCTA TTTCGAATCC
GATTCGATGC TCAACCATCT GCTGGAACTG CGCCACCAGG TCGACTACCT GCGCACCACC
ATTTCGCAGC AGACCCAGAC GCCCCAGCTG GTGGGCAATT CCAAGGGCTT CCAGCACGCC
TACGACCTCG TCACCCGCGC CTCGGGTACC CAGGTCACCG TGCTGCTGCT TGGTGAAACC
GGCGTCGGCA AGGAACGCTT CGCCCGCACA CTTCACCAAA TGAGCAACCG CAAGCAGGGG
CCGTTCGTCG CGGTCAATTG TGCGGCCATG CCCAATGACC TGATCGAATC CGAATTGTTC
GGCGTCGAGA AAGGCGCTTT CACCGGCGCC CATACCTCGC GCATGGGCAA GTTCGAGCGG
GCGGACGGTG GCACCCTGTT CCTCGACGAG ATCGGCGAAC TGCCGCTCGC CGCCCAGGCC
AAGCTGCTGC GCGTGCTGCA GGAAGGCGAG ATCGAACGCC TGGGCGACGA CCGGGTACGC
AAACTGAACA TCCGCCTCGT CGCCGCCACC AACGTCGACC TGCAGGCCGC CGTCAAGGCC
GGGCGTTTCC GCTCCGATCT CTTTTACCGG CTGAGTGTCT ATCCGATCCA GATCCCGCCG
CTGCGCGAAC GCATCGCCGA CATCCCGCTG CTGATCGAAG CCATGCTCGC CCGCTTCAGC
ACGCTCTATG AAAAGAAACT GCTCGGCGTC AGCGACAAGG CGATGCAGGC CATCAAGCGC
TATCAGTGGC CGGGCAACGT GCGCGAACTG GAAAACATGA TCGAGCGCGG CATGATCCTC
GCCCCCAACG GCGGCTGGAT CGAACAGGAA CACCTGTTCG CCAATGTCGC CGAAAGCGAT
TTCAGCGAAG CGCAGATCAC CCAGGCCGGC AGCCTGGAGC GAAAACAGGA ACCCGGTGCC
CCGGATAGCC TGATCGAAGC CATTTTCAGC AGCGGCATTG GTTTCGAGGA ACTGGGAAAC
CAGCTGATCA ACGAGGCCGT CAAACGCGCT GACGGCAACC TGGCCGGCGC GGCCCGCACG
CTAGGCATCA CCCGGCCGCA ACTGCAATAT CGCCTGAAAA AGAAGGACGG CGTCTGA
 
Protein sequence
MTTIQYPESN DLRNLVKFCE KDGTIWLAEN RMVLMHTSAL GALRRELVGS VGKEHARRLL 
TRMGYAAGVC DAELAKRIRG NLSMQDAFFT GPQLHMLEGI VRVTPVHMNL DFDKGTFSGE
FLWENSWEQD IHLRELGQSD EPVCWSQIGY ASGYTSAFMG RFILFKEIEC VACGHNNCRI
VGKPLEEWED AEQHASYFES DSMLNHLLEL RHQVDYLRTT ISQQTQTPQL VGNSKGFQHA
YDLVTRASGT QVTVLLLGET GVGKERFART LHQMSNRKQG PFVAVNCAAM PNDLIESELF
GVEKGAFTGA HTSRMGKFER ADGGTLFLDE IGELPLAAQA KLLRVLQEGE IERLGDDRVR
KLNIRLVAAT NVDLQAAVKA GRFRSDLFYR LSVYPIQIPP LRERIADIPL LIEAMLARFS
TLYEKKLLGV SDKAMQAIKR YQWPGNVREL ENMIERGMIL APNGGWIEQE HLFANVAESD
FSEAQITQAG SLERKQEPGA PDSLIEAIFS SGIGFEELGN QLINEAVKRA DGNLAGAART
LGITRPQLQY RLKKKDGV