Gene Daro_3391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3391 
Symbol 
ID3567121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3643970 
End bp3644989 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content60% 
IMG OID637681863 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_286590 
Protein GI71909003 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGCA GAACCTTCAT CGCCGCCGCT GTGGCTACCA CCCTGTTGAG TGCCAGCGCA 
GTTCAGGCGC AGACTTATCG AGCTGAATAT CGTCTGTCGA CGACGCTCGG TACGGCATTT
CCCTGGGGGC AGGCCGGTGA GCGCTGGATT GAACTGGTCA AGGAAAAAAC GCAAGGCCGC
ATCGTCATCA AGCTCTACTC GGGCAACTCG CTGGTCGGTG GTGACCAGAC GCGCGAATTC
ACGGCGATTC GCCAGGGCGT GATCGACATG TCCATCGGTT CGACCATCAA CTGGTCGCCG
CAGGTCAAAG AACTGAACCT GTTTTCGCTG CCCTTCCTGA TGCCGGATTA CAAGGCGATT
GATGCTTTGA CCCAAGGCGA GGTCGGCAAG GATTTGTTTG CTGTCCTGGA GAAGCGCGAA
GTCGTGCCGC TGGCCTGGGG TGAGAACGGC TTCCGCGAAA TGTCCAACTC CAAGCGGCCG
ATCGCATCGC CGGCTGACAT GAAGGGTATG AAGTTCCGTG TCGTCGGCTC CCCTCTCTAC
AACGAAACCT TCTCGGCACT TGGTGCGAAC CCGACCCAGA TGAGCTGGGC GGATGCCCAG
CCGGCGCTGG CTTCCGGCGC TGTCGATGGC CAGGAAAATC CGCTGTCCGT GTTCGTGGCT
GCCAAGCTGC CGACGGTTGG CCAGAAATAC CTGACCTTGT GGCATTACGT GGCAGACCCG
CTGATCTTCG TCGTCAACAA GCAGGTCTGG GCCAGCTGGA CACTCGCCGA TCGCGAGGCG
GTGAAGCAGG CGGCATTGCA GGCAGGGCGT GAAAACATTG AAAAGGCGCG CAAGGGTATT
GCCGGCAATG ACAACACCGT CCTCAAGCAG ATCGAAGCGG CCGGCGTAAC TGTGACCAAT
CCGACTGCCG AGCAGCGTAA CGCCTTCGTT CAGGCTACGC GCCCGGTTTA CGACAAGTGG
TCGAAGACCA TTGGCGCAGA TCTGGTCAAG AAGGCCGAAA CCGCCATCGC CAAGCGCTGA
 
Protein sequence
MLRRTFIAAA VATTLLSASA VQAQTYRAEY RLSTTLGTAF PWGQAGERWI ELVKEKTQGR 
IVIKLYSGNS LVGGDQTREF TAIRQGVIDM SIGSTINWSP QVKELNLFSL PFLMPDYKAI
DALTQGEVGK DLFAVLEKRE VVPLAWGENG FREMSNSKRP IASPADMKGM KFRVVGSPLY
NETFSALGAN PTQMSWADAQ PALASGAVDG QENPLSVFVA AKLPTVGQKY LTLWHYVADP
LIFVVNKQVW ASWTLADREA VKQAALQAGR ENIEKARKGI AGNDNTVLKQ IEAAGVTVTN
PTAEQRNAFV QATRPVYDKW SKTIGADLVK KAETAIAKR