Gene Daro_3241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3241 
Symbol 
ID3566587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3492118 
End bp3493263 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content63% 
IMG OID637681712 
Productextracellular ligand-binding receptor 
Protein accessionYP_286441 
Protein GI71908854 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.000702514 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC GCCAGCTTCT CTCTGTTCTG TCTGCCGCCT GCCTGCTTGC CTTCAGCGGC 
ATCAACCATG CCCAGGCGCA AACCGTCACC ATTGGCGTCA GCGTCAGCTC GACCGGCCCG
GCCGCTTCGC TCGGCATCGC CCAGAAGAAC ACCATCGAAC TGCTGCCGAA GACGCTGGGC
GGCATGCCGG TGAACTACGT CGTGCTCGAC GACGCCTCCG ATCCGACATC GGCCGGCAAG
AATGCCCGTC GCTTTGCCGA TGCCGATCAC GTTGATGCCA TCATCGGTTC AACCACCGTG
CCGACCTCGC TGGCCGTGGC CGAAGTGGCC GGCGAAGCGA AGATTCCGCA AATCGCCCTG
GCCCCGTTCC CGCCCAAGCA GATCCAGTGG GTCTTCCCGC TGCCGCACGG GGTTGGCGTG
ATGTCCGCCG CACTGTTCGA GGAAATGAAG AAGCGTGACA CCAAGACCAT CGCCTTCATC
GGCTTTTCCG ACGCCTACGG CGAAGCCTGG CTGAAGGATG TCGAAAAGCG CGCCGAAGCC
AACGGCATCA AGCTGGTCGC CGTCGAGCGC TATGCCCGCA CCGACCAGTC GGTCACGGCC
CAGGCCCTGA AGCTGGTTGG CCAGAAGCCG GACGCCATCT TTATCGCGGC TTCCGGAACC
CCGGCCGCGC TGCCGATGCG TGCTTTGCGC GAACGTGGCT TCAAGGGCCA GTTCTATCAG
ACCCACGGCG CCGCCAATAA CGACTTCCTG CGCGTCGCCG GTAATTCCGA CGAAGGCGTG
ATCCTGCCGA CCGGGCTGGT GCTGGTGGCC GAGCAACTGC CGGACAGCAA TCCGAGCAAG
AAGGTAGCAC TCACTTACCT GAAGGCTTAT GAAGGCAAGC ACGGTGCCGG TTCGCGCAAC
CCGTTCGGCG CCTATGCCCA TGATGCCTAC CTGCTGCTCG ACAAGGCCGT GGCCGCCGTC
GGCAAGAAGG CCGTACCTGG TACGCCGGAA TTCCGCGCCG CCTTGCGCGA TGCGCTGGAA
AAGACCAACA ATGTGCCTTT GACGCACGGT AGCGACACCA TGTCGGCGAC GGATCACTCG
GGTTTGGGCG AAGGGGCGCG GGTATTGATC ACGATCGAGA AAGACAGCTG GAAGCTGTTG
AAGTAA
 
Protein sequence
MKRRQLLSVL SAACLLAFSG INHAQAQTVT IGVSVSSTGP AASLGIAQKN TIELLPKTLG 
GMPVNYVVLD DASDPTSAGK NARRFADADH VDAIIGSTTV PTSLAVAEVA GEAKIPQIAL
APFPPKQIQW VFPLPHGVGV MSAALFEEMK KRDTKTIAFI GFSDAYGEAW LKDVEKRAEA
NGIKLVAVER YARTDQSVTA QALKLVGQKP DAIFIAASGT PAALPMRALR ERGFKGQFYQ
THGAANNDFL RVAGNSDEGV ILPTGLVLVA EQLPDSNPSK KVALTYLKAY EGKHGAGSRN
PFGAYAHDAY LLLDKAVAAV GKKAVPGTPE FRAALRDALE KTNNVPLTHG SDTMSATDHS
GLGEGARVLI TIEKDSWKLL K