Gene Daro_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4021 
Symbol 
ID3567193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4319973 
End bp4321325 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content62% 
IMG OID637682494 
ProductTRAP dicarboxylate transporter, DctM subunit 
Protein accessionYP_287218 
Protein GI71909631 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1593] TRAP-type C4-dicarboxylate transport system, large permease component 
TIGRFAM ID[TIGR00786] TRAP transporter, DctM subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0000127778 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAAT CAAGCATTGG CATTCTCTAC GGCCTGGGCA CCTTGGCCGT GATGTTCTCC 
GGCATTCCCA TCGCCTTCGC GCTGGGCTCC ATTGCCGTCC TTTTCATGCT GCTCTTCATG
CCGGCAGCGG CGACCGACAC CATTACCCAG AACGTCTTCG AGGAAATGGC CAACATCACG
CTGCTGTCGA TCCCGCTATT CATCCTGAAA GGCGCGGCCA TCGGCCGATC GCGCGCCGGC
GCCGACCTTT ACCTGGCGAT TCACGCCTGG ATGCACAAGA TTCCCGGTGG CCTCGGCATC
GCCAACGTCT TCGCCTGTGC GCTGTTTGCG GCGATGGCCG GCTCCAGCCC GGCGACCTGT
TCGGCGATTG GTGGCGCTGG CATTCCGGAA ATGCGGGCGC GCGGCTATTC ACCCGGGTTC
GCCGCCGGCA TCATCGCCGC CGGTGGCACG TTGGGGATTC TGCTGCCGCC GTCGATCACG
ATGATCCTGT TCGCGGTCGC CGCCGAGCAG TCGCTGGGCC GCCTCTTCCT CGCCGGCATC
TTCCCCGGCC TGCTGCTGGT CTTCCTGTTC GCCGGCTATG CCGTGTTCCG CTTCAAGGTC
GAATACAACT CGGCACTCGA TGTCTGGCAT TCCGGTGGCG CCAAGTCGGC CTACCTCGAC
GAAATGCGCA TGACCCGGGC CGACAAGACC CGCATGCTGC CGCGCGTGAT TCCCTTCGTG
CTGCTGCTGA TCGGCGTCAT GTTGGCGCTT TATGGCGGCT ACGCGACGCC GTCGGAAACC
GCCGGTCTTG GCGGCATCCT GGCGCTTGCC CTGATCGCCG GCATCTACGG CGTCTGGCGT
CCTTCGCAAC TGAAACCGAT CCTCAACAGC ACGTTGAAGG AGTCGACCAT GTTGATGTTC
ATCATCGGCA TGTCGCTGCT CTACTCCTAC GTGATGAGCT ATCTGCACAT CAGCCAGTCG
GCGGCCGAGT GGATCGTGGC CATGCATCTG TCGAAATGGC TGCTGCTCGC TGCCATCCTG
ATTTTTGTGG TGATTCTCGG CTTCTTCCTG CCGCCGGTGT CGATCATCCT GATGACCGCG
CCGATCATCC TGCCGCCGCT CAAGGCCGCT GGCTTCGACC TAATCTGGTT TGGCATCGTG
ATGACCGTGG TCATGGAAAT GGGCCTGATT CATCCGCCAG TCGGCCTCAA TATCTTCGTG
ATCAAGAACG TCGCGCCGGA CATTCCGCTC AAGGACATCG TCTGGGGTGT CTTCCCCTTC
GTCGGTCTGA TGTTCCTCGC CGTGATCCTG CTTTGCATCT TCCCGGGCAT TGCCACCTGG
TTGCCGGATG TGGTCATGGG CGTGGCGAGC TAA
 
Protein sequence
MAESSIGILY GLGTLAVMFS GIPIAFALGS IAVLFMLLFM PAAATDTITQ NVFEEMANIT 
LLSIPLFILK GAAIGRSRAG ADLYLAIHAW MHKIPGGLGI ANVFACALFA AMAGSSPATC
SAIGGAGIPE MRARGYSPGF AAGIIAAGGT LGILLPPSIT MILFAVAAEQ SLGRLFLAGI
FPGLLLVFLF AGYAVFRFKV EYNSALDVWH SGGAKSAYLD EMRMTRADKT RMLPRVIPFV
LLLIGVMLAL YGGYATPSET AGLGGILALA LIAGIYGVWR PSQLKPILNS TLKESTMLMF
IIGMSLLYSY VMSYLHISQS AAEWIVAMHL SKWLLLAAIL IFVVILGFFL PPVSIILMTA
PIILPPLKAA GFDLIWFGIV MTVVMEMGLI HPPVGLNIFV IKNVAPDIPL KDIVWGVFPF
VGLMFLAVIL LCIFPGIATW LPDVVMGVAS