Gene Daro_0830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0830 
Symbol 
ID3569455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp901387 
End bp902664 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content59% 
IMG OID637679286 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_284056 
Protein GI71906469 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones71 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0216762 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGACA AGAAACTGGC CCCGGAAGGC ACGACGACGG CCAAATCATC CCTTTCTCGT 
CGCGATTTCG TTTCAACCGC ATTGGGAGCC TCTCTGATGG CCATGGTCCC CCCCGGCGTT
CGCAGCGGTG CCTGGGCCGC TGGTTCCGAC GCCCCCGAAA AAAAGGAAGT CCGTATTGGC
TTCATCCCGC TCACCGACTG CGCCTCCGTT GTTATGGCTT CGGTCATGAA GTTCGATGAG
AAATACGGCA TCAAGATCAT TCCGACCAAG GAAGCCTCGT GGGCTTCGGT GCGTGACAAG
CTGGTCAATG GCGAGCTCGA CGCCGCTCAC GTGCTGTATG GCCTGATCTA TGGCGTGCAG
TTGGGCATCG GCGGCCCGAA GAAAGACATG AACGCATTGA TGAGCCTGAA CCACAACGGC
CAGGCCATCA CGCTGTCGAA TCAGCTCAAG GACAAGGGGG CCACTGACGG CGCTGGTCTG
GCCAAACTGG TTGCCAAGAA GGAACGTGAA TACACCTTCG CCCAGACCTT CCCGACCGGC
ACCCACGCCA TGTGGTTGTA CTATTGGCTG GCTTCGCAGG GCATCAATCC GATGAAAGAT
GTCAAGACCA TCACCGTGCC GCCACCGCAG ATGGTCGCCA ACATGCGTGT CGGCAACATG
GATGGTTTCT GTGTCGGCGA GCCTTGGAAC AACCGCGCCA TCATGGACAA CATCGGCTTC
ACGGCGACGA CGACCCAGGA CATCTGGACC GATCACCCGG AAAAGGTGCT CGGCACCACG
GCAGATTGGG TCAAGCAAAA CCCGAACACG GCCCGTGCCG TAGTCGCGGC GATTCTTGAT
GCCAGCAAGT GGATCGACGC CTCGATCGCC AACAAGCAGA AGACGGCCGA GACGATTGCC
AGCAAGTCCT ATGTCAATAC CGATACCGAA GTCATCGTCG CCCGCATGCT CGGGCGCTAT
CAGAATGGTC TGGGCAAGAG CTGGGATGAC AAGAACTGCA TGAAGTTTTT CAACGAAGGT
ACGGTCAATT TCCCGTACCT CAGCGATGGC ATGTGGTTCC TGACCCAGCA CAAGCGCTGG
GGGTTGCTGA AGAGCCACCC GGATTACCTG GCCACCGCCA AGCAGGTCAA CCGCATCGAT
ATCTACAAGC AGGGTGCGGC CGCTGCCGGT GTCGCGCTGC CGAAGAGCGA AATGCGCAGC
CACAAGCTGA TCGACGGGGT GGTCTGGGAT GGCAAGGACC CGGCCAAGTA CGCCGACAGC
TTCAAGATCA AGGCCTGA
 
Protein sequence
MEDKKLAPEG TTTAKSSLSR RDFVSTALGA SLMAMVPPGV RSGAWAAGSD APEKKEVRIG 
FIPLTDCASV VMASVMKFDE KYGIKIIPTK EASWASVRDK LVNGELDAAH VLYGLIYGVQ
LGIGGPKKDM NALMSLNHNG QAITLSNQLK DKGATDGAGL AKLVAKKERE YTFAQTFPTG
THAMWLYYWL ASQGINPMKD VKTITVPPPQ MVANMRVGNM DGFCVGEPWN NRAIMDNIGF
TATTTQDIWT DHPEKVLGTT ADWVKQNPNT ARAVVAAILD ASKWIDASIA NKQKTAETIA
SKSYVNTDTE VIVARMLGRY QNGLGKSWDD KNCMKFFNEG TVNFPYLSDG MWFLTQHKRW
GLLKSHPDYL ATAKQVNRID IYKQGAAAAG VALPKSEMRS HKLIDGVVWD GKDPAKYADS
FKIKA