Gene Daro_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_1000 
Symbol 
ID3569325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp1092748 
End bp1094733 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content64% 
IMG OID637679459 
Producthelix-turn-helix, Fis-type 
Protein accessionYP_284226 
Protein GI71906639 
COG category[K] Transcription
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3284] Transcriptional activator of acetoin/glycerol metabolism 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAAA TTCAAATGCT GGCCGAAGTC CATGACCAGC GCCTGCAACA GGCAAGGCAA 
CTATTCTTTG ATCAGGGCGG CTTGCCCGAG GGTCTGATCG ACCCGCTGAT TCTCCGTTCG
TGGGAGCGCT GTCGGCGCTT TGGTCTGGGC GAACTCAGCC TGACACCGGC CACCGAAGCA
ATGGATCGCG TCGCCCTGAA AACCGAACAG GATCGCAACC GTTATCTGCT GATGCAGGGC
CGGCCGATCA TGGAGCATGT CTTCGAGCAG ATTCGCGACT CGGGCAGCAT GGTCATCCTG
GCCGACGCCA ACGGCCTGCT GCTGGAAACC GTCGGCGACC CGGAATTCGT CAACCGGGCT
GATCGTGTCG CACTGTCCGC CGGCGCCTCG TGGGATGAAA ACCTGCGCGG CACCAATGCC
ATCGGCACCG CGCTTTCCGA AGAAGCCCCG GTCGCCGTCC TTGGCGGCGA ACACTTCATC
GAACACAACG GCTTCCTGAC CTGCTGCGCC AGCCCCATCT TCGGTCCGGA TGGGCGTCTG
ATCGGCGTCC TCGACATTTC CGGCGACTAC CGCAGCCATC AACGCCACAC GCTGGGCCTG
GTCCGCCTGT CCTCGGCCAT TGTCGAAAAG CGCCTGTTCG AATCGATTCA CGCCCGCGAC
ATCCTGGTCT GCTTCCATAG CCGCCCCGAC TATCTGGGCA GCCCGAAGGA AGGCATCGCC
GCCGTTTCGC CGGATGGTCA GGTACTGGCG ATCAATCGCA ACGGCACCGA GATTCTCGGC
ATCCGCCAGG TCGACGCCGT GCGCCGCGAT TTCTCCATGG TCTTCGAGAG CAACCTGTCC
GCCCTTGTCG ACCGTCTGCG CCACAACTCT CAGGGCACCT GCGAAATCAA TGTCAGCGGC
AAGGTCATCA ACGTCCAGCT GCGCGGTCAG TTGCCGCCGC TGGCCGTGGC CGGGCGTGTT
TTCGACGAGC CCCTGCCGCA ACGCGCGCCG CGCCGCGCCG AAACCGCGGC CGCACCAACG
CTGACGCTGG ACACCCTGAA CACCGGCGAC CCCCGCCTGC AGGCGGCCAT CGACCGCGCC
CGCCGCATGC TGGGTCGCGA CATCCCCATC CTGATCCAGG GCGAATCCGG CGCCGGCAAG
GAAATGTTCG CCAAGGGCTA CCACAACAGC GGCCCGCGTC GTGACCAGGC CTTCGTCGCG
CTCAACTGCG CCTCCATTCC GGAAACCCTG ATCGAATCGG AACTTTTCGG CTATCAGGGC
GGCGCCTTTA CCGGCGCCCG CAAGGAAGGC GCCCCGGGCA AGATTCAGCA GGCCCATGGC
GGCACGCTGT TCCTCGATGA AATCGGCGAC ATGCCGCTCA ACCTGCAGGC CCGCCTGCTA
CGCGTGCTGC AGGAACGCTG CGTGACACCT TTGGGCAGCA CACGTTCGAT CCAGGTCGAT
ATCTCGCTGG TCTGTGCCAC GCACCGCAAA CTACGCGAAG AAGTCGCCCG CGGCACCTTC
CGCGAAGATC TCTATTACCG CCTGAACGGC ATGAGCGTTA CCCTGCCCGC CCTGCGCGAA
CGAACCGACA TCCGTTCCAT GGTCGCCAAA CTGGCTGCTG TCGAAATCGC CGCACGTGGT
GGTCCGGTCA AGTTTTCCGA AGGCGCGCTG CAAGCTATCG AAGGGTACAG CTGGCCGGGC
AATATCCGCC AGCTGTTCAA CGTCATCCGC GTCGCCATTG CGCTACTCGA CGATGATGAA
ACCCTGATCA CCGAAAGCCA TCTGCCGGAA GAACTGTTCG AATCCTCCCC GCTCGCCGCG
ACCGCCAGCG TTCCAGCCTA CGACCCATGG GCCGCCGCAC CTCTCGAAGG CGCCAACAGC
ATGGATGCGA TCAGCCGCCA AGCTGCGATG CGAGCACTGG AAGCAGCCGG CGGCAACATT
TCGTCGGCGG CCCGCCAACT CGGCATCAGC CGCAACACGC TGTACCGCAA GCTGGGACGG
ATGTAG
 
Protein sequence
MGQIQMLAEV HDQRLQQARQ LFFDQGGLPE GLIDPLILRS WERCRRFGLG ELSLTPATEA 
MDRVALKTEQ DRNRYLLMQG RPIMEHVFEQ IRDSGSMVIL ADANGLLLET VGDPEFVNRA
DRVALSAGAS WDENLRGTNA IGTALSEEAP VAVLGGEHFI EHNGFLTCCA SPIFGPDGRL
IGVLDISGDY RSHQRHTLGL VRLSSAIVEK RLFESIHARD ILVCFHSRPD YLGSPKEGIA
AVSPDGQVLA INRNGTEILG IRQVDAVRRD FSMVFESNLS ALVDRLRHNS QGTCEINVSG
KVINVQLRGQ LPPLAVAGRV FDEPLPQRAP RRAETAAAPT LTLDTLNTGD PRLQAAIDRA
RRMLGRDIPI LIQGESGAGK EMFAKGYHNS GPRRDQAFVA LNCASIPETL IESELFGYQG
GAFTGARKEG APGKIQQAHG GTLFLDEIGD MPLNLQARLL RVLQERCVTP LGSTRSIQVD
ISLVCATHRK LREEVARGTF REDLYYRLNG MSVTLPALRE RTDIRSMVAK LAAVEIAARG
GPVKFSEGAL QAIEGYSWPG NIRQLFNVIR VAIALLDDDE TLITESHLPE ELFESSPLAA
TASVPAYDPW AAAPLEGANS MDAISRQAAM RALEAAGGNI SSAARQLGIS RNTLYRKLGR
M