Gene Daro_4023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_4023 
Symbol 
ID3567195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4321875 
End bp4322900 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID637682496 
ProductTRAP dicarboxylate transporter- DctP subunit 
Protein accessionYP_287220 
Protein GI71909633 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.000715692 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGAAA TCCGGCGCAA TATCCTGAAA GGCATGGTTG CCACACCGGC TTTGGCCCTG 
TCGCCCAATC TGCTGTGGGC GCAAGCGGCT GGCGAGTTGA AAATTTCGCA CCAGTTCCCT
GGTGGCACCG CGACCGAAGG CGATTTCCGC GACCGACTCT GTCGCCGCTT CAGCGCCGAA
ATTACCAAGC GAACCAATGG CGCCTTGAGG GGCACGGTTT ATCCCGGATC GTCGTTGATG
AAGACCAACG CCCAGTTCAG TTCGGTACGC AAAGGCGCGC TCGACATGAC GCTGGTGCCG
CTCTCCTATG CCGGCGGCGA AGTGCCGGAA ACCAACATCG GGCTGATGCC GGGCATCGTC
ACTTCCTACG AGCAGGCAGT GAGCTGGAAG AAAGCCGAGA TCGGCAAGGC GCTGGCCAAT
ATCCTCGCCG ACAAGGGCGT GCTGGTTGTC AGCTGGATCT GGCAGGCCGG CGGCGTCGCC
AGTCGGGTCA AGCCGATCAT CGATCCGGAA GATGCCAAGG GCCTGAAAGT CCGCGGCGGC
AGCCGCGAGA TGGACATGGT GCTGAAGCAG GCCGGCGCCA CGGTGCTGAC CCTGCCGTCG
AACGAAATCT ACGCCGCGAT GCAGACCGGC GCGCTGGATG CCGCGATGAC CTCGTCGACC
AGCCTGATTT CCTTCCGTCT CGAAGAAGTC GGCAAGGCGC TGACCACCGG CCGCGGCAAG
ACTTACTGGT TCATGTTCGA ACCCTTGCTG ATCTCCCGTG CGGTCTTCGA GAAACTGCCC
AAGGCGCAGC AGGATGCGAT CATGGCGGTT GGTGCCGAGA TGGAGGCTTA CGCGCTGGAA
GGGGCCAGGG CCGATGACCA GGCCGTGGCG GCGGTTTACC AGAAAGCCGG CGGCAAGGCT
TACGACCTGT CCGACGCCTC GGTCAAGAAA TGGCAGGCGA TTGCCCGCGA TACCGCCTGG
AAGGACTTCG CGGCCAAGAA TGAGAGCTGC GCCCGTATCC TCAAACTGGC CGAGGCCACG
CTGTGA
 
Protein sequence
MNEIRRNILK GMVATPALAL SPNLLWAQAA GELKISHQFP GGTATEGDFR DRLCRRFSAE 
ITKRTNGALR GTVYPGSSLM KTNAQFSSVR KGALDMTLVP LSYAGGEVPE TNIGLMPGIV
TSYEQAVSWK KAEIGKALAN ILADKGVLVV SWIWQAGGVA SRVKPIIDPE DAKGLKVRGG
SREMDMVLKQ AGATVLTLPS NEIYAAMQTG ALDAAMTSST SLISFRLEEV GKALTTGRGK
TYWFMFEPLL ISRAVFEKLP KAQQDAIMAV GAEMEAYALE GARADDQAVA AVYQKAGGKA
YDLSDASVKK WQAIARDTAW KDFAAKNESC ARILKLAEAT L