Gene TM1040_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1820 
Symbol 
ID4076966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1914075 
End bp1915658 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content62% 
IMG OID638007135 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_613815 
Protein GI99081661 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.575663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.77509 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACT ATCTTAAACA CCTCGCACAG GAAGCCGCAT TGGGCCGCAT GGGCCGCCGC 
GAGTTCCTCG GCCGCGCTGC TGCGCTGGGT GTGACCGCAT TGGGCGCGAA CTCGCTGCTG
GCCTCTGCGG CCAAAGCGGC AGGCGCGCAA AAAGGCGGCA CCCTGCGTGT GGGTCTGCAA
GGGGGCTCCA CCACCGACAG CCTCGACCCG GCTCTGGCGA CAAACCAGGT GGCCCTCTCG
GTGATCCGTC TCTGGGGTGA GCCGCTGGTG GAACTGGGCG AAGAAGGCGG CCTCGTGGGT
GCCGTCGCCG AAAGCTTTGA AGCCTCGGCA GACGCAAAAA CCTGGACCTT CAAGCTGCGC
TCCGGCCTGA CCTTCTCCAA CGGCCAGCCG GTGACGGCGG CGGATGTTGT CGCCACCATG
GAGCGTCACT CCGGTGAGGA CACAAAATCC GGTGCGCTTG GCATCATGCG CTCCATTTCC
AACGTCAAGG CCGATGGCGA TACGGTGGTC TTTGAGCTCG AAGACGCAAA CGCCGACCTG
CCCTACCTGC TGACCGACTA CCACCTGGTG ATCCAGCCCA ACGGCGGCAA GGACGATCCG
GCGGCCGCGA TCGGCACCGG TCCTTATGTT CTGAAAGCCG TGGACATGGG CGTGCGCTTT
GTGGCCGAGA AGAACCCCAA CTACTGGGGC GATCTGGGCA ATGCGCAGAC CATCGAGTAC
GTGGTCATCA ACGACAATAC CGCCCGTGTG GCTGCACTGC AGGCCGGTCA GGTGGACATG
ATCGACCGCG TGCCGCCGCG CACCGCGAAA CTGGTGGATC GCGCGCCGAA CATCTCCGTG
CACTCCACTG CGGGTCCGGG CCACTATGTG TTCATCGCCC ATTGCGACAC CGATCCCTTT
GCCAACAACG ACGTGCGCCT CGCACTGAAA TACGGCATCA ACCGTCAGGA AATGGTCGAC
AAGATCCTGA ACGGCTTTGG CTCCGTGGGC AACGACTCGC CGATCAACGC CTCCTACCCG
CTGTTCACCC AGCTGGAGCA GCGCGAGTAC GATCCCGAGA AGGCCAAGTT CCACATGGAA
AAATCGGGCT ATGACGGTCC GATCCTGCTG CGCACCTCCG ACAACTCCTT CCCCGGAGCC
CCGGATGCGG CGGCGCTGTT CCAGCAGTCG CTGGCAGCGG CAGGCATCAA CCTCGAGATC
AAGCGTGAGC CCAACGATGG CTACTGGTCC GAGGTCTGGA ACAAGCAGCC CTTCTGCACC
TCCTACTGGG GTGGCCGTCC GACGCAGGAC TCCATGTTCT CCACCGCCTA TCTGTCGACC
GCAGACTGGA ACGACACCCG TTTCAAGAAC GAGCAGTTCG ACCAGACACT GCTGGCGGCC
CGTGGCGAGC TCGACGAAGC CAAGCGCACC CAGATGTATG CAGATATGTC GACCATGGTG
CGCGACGAAG GCGGCCTGAT CTGCCCGATG TTCAACGACT TTGTCGATGC AACGTCGGAT
CGTCTGGACG GCTGGAAAGA CGGCGTCAAA GGTCACGCTC TCATGAACGG TTACGCGCCC
CTGAAAATGT GGGTCAAAGC CTGA
 
Protein sequence
MNDYLKHLAQ EAALGRMGRR EFLGRAAALG VTALGANSLL ASAAKAAGAQ KGGTLRVGLQ 
GGSTTDSLDP ALATNQVALS VIRLWGEPLV ELGEEGGLVG AVAESFEASA DAKTWTFKLR
SGLTFSNGQP VTAADVVATM ERHSGEDTKS GALGIMRSIS NVKADGDTVV FELEDANADL
PYLLTDYHLV IQPNGGKDDP AAAIGTGPYV LKAVDMGVRF VAEKNPNYWG DLGNAQTIEY
VVINDNTARV AALQAGQVDM IDRVPPRTAK LVDRAPNISV HSTAGPGHYV FIAHCDTDPF
ANNDVRLALK YGINRQEMVD KILNGFGSVG NDSPINASYP LFTQLEQREY DPEKAKFHME
KSGYDGPILL RTSDNSFPGA PDAAALFQQS LAAAGINLEI KREPNDGYWS EVWNKQPFCT
SYWGGRPTQD SMFSTAYLST ADWNDTRFKN EQFDQTLLAA RGELDEAKRT QMYADMSTMV
RDEGGLICPM FNDFVDATSD RLDGWKDGVK GHALMNGYAP LKMWVKA