Gene TM1040_1417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1417 
Symbol 
ID4078047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1510334 
End bp1512001 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content60% 
IMG OID638006727 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_613412 
Protein GI99081258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.628959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.153951 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCA CCAAACTCAA GAGCTCGACG ACCCATCATT GGGCGGCTGA GATGCATGCG 
CAGGAACTGA AGGCGGGCAA GATCTCGCGC CGCGAGTTCC TGACCCGTGC GACAGCGCTT
GGTGTCTCGG CCACCGCCGC CTATGGGATG ATCGGCCTGA GCGCCCCTGC CCGTGCGGCG
GACAACATGA AAGAAGGCGG CACCCTGCGC GTGCAAATGA GCGTGCGCGG CCTCAAGGAT
CCCCGCACCT TCGACTGGTC CGAAATCGGC AACCAGGCGC GTGGCACGCT TGAATACCTG
GTTGAATACA ACAACGACGG CTCCTTCACC CCGATGCTCC TGGAGAGCTG GCAGGTGAAT
GACAGCGCCG ATGAATACAC CCTGAACGTT CGCAAGGGCG TCAAATGGAA CAACGGCGAC
GACTTTACCG CCGAAGATGT GGCCCGCAAC ATCGAAGGTT GGTGCGACAA GGACATGGAA
GGCAACTCCA TGGCTGGCCG CTTTGCCACG CTTGTCGATC CCGACACCGG CAAAGCCATC
GAGGGCGCGA TCGAGGTGGT AGACAGCCAC ACCGTGAAAC TGAAACTGCC TAACTCCGAC
ATTTCCATCA TTCCGGGCAT GGCGGATTAT CCGGCAGCGA TCACCCATGC GAGCTTTAGC
GCGGATGATG CGCAAAACGC CGTGGGCACC GGTCCCTATC TGATTTCCGA GCTCGAAGTC
GGCGTCAAAT GCGTGCTCAC GCGCAATGAG AACCACAGCT GGTGGGGTGA AGCGATCTAC
GGCAAGCCGG CGCTCGACAG CATCGAATAC ATCGATTTCG GCACCGATCC GTCGTCGTGG
CTGGCAGCGC TGGAATCCGA AGAAGTCGAT ATGCTCTATG AATCCGTTGG CGAATTCATC
GACTTGATGG ACTCGCTTGG CTTCGTGAAA TCCGAAGTTG TCACCATGGC AACCATCGTC
GTGCGTCCCA ACCAGCTGGC CGAGGTGGAC GGCATGTCCC CCTATGCCGA CAAACGCGTG
CGCAAGGCGC TGCAGATGGC GGTGGACAAT GATGTCTGCC TGGAGCTTGG CTATGGCAAC
CGCGGTGTTG CTGCGGAAAA CCATCACGTC GGCCCGGCGC ATCCTGAATA TGCCGAGCTC
CCTAAAGAGG CGGCTGATCC CGATGGCGCC CGCGCGCTGA TGGAAGAAGC CGGCATGATG
GATTTCGAAC ATGAGCTCTT CTCCATCGAC GATGACTGGC GCAAGAACAC CACCGATGCG
GTCGCTGCCC AGCTGCGTGA CGCGGGGTTC AAGGTCAAGC GGACCGTGCT GCCGGGATCG
ACCTTCTGGA ATGACTGGAC CAAATATCCG TTCAGCTCGA CCAACTGGAA CCACCGTCCC
TTGGGTGTTC AGATCTGGGC GCTGGCCTAC CGTTCAGGTG AGGCCTGGAA CGAATTCGGC
TGGTCCAACC CCGAGTTCGA CCAGATCCTG GCCGACGCGC TTGCCATCGC CGATGCCGAC
GCGCGCCGCG AAGTCGTCGC CAAAGGTCAG AAGATGATCC AGGACGAGGG TGTCACCATC
CAGCCCTATT GGAAATCCCT GTTCCGTCAC ATGCGCGAAG GCGTGCGCGG TGCCGATATG
CATATCTCCT ACGAGCACCA TCACTACAAA TGGGGCTGGG CTGCCTGA
 
Protein sequence
MTTTKLKSST THHWAAEMHA QELKAGKISR REFLTRATAL GVSATAAYGM IGLSAPARAA 
DNMKEGGTLR VQMSVRGLKD PRTFDWSEIG NQARGTLEYL VEYNNDGSFT PMLLESWQVN
DSADEYTLNV RKGVKWNNGD DFTAEDVARN IEGWCDKDME GNSMAGRFAT LVDPDTGKAI
EGAIEVVDSH TVKLKLPNSD ISIIPGMADY PAAITHASFS ADDAQNAVGT GPYLISELEV
GVKCVLTRNE NHSWWGEAIY GKPALDSIEY IDFGTDPSSW LAALESEEVD MLYESVGEFI
DLMDSLGFVK SEVVTMATIV VRPNQLAEVD GMSPYADKRV RKALQMAVDN DVCLELGYGN
RGVAAENHHV GPAHPEYAEL PKEAADPDGA RALMEEAGMM DFEHELFSID DDWRKNTTDA
VAAQLRDAGF KVKRTVLPGS TFWNDWTKYP FSSTNWNHRP LGVQIWALAY RSGEAWNEFG
WSNPEFDQIL ADALAIADAD ARREVVAKGQ KMIQDEGVTI QPYWKSLFRH MREGVRGADM
HISYEHHHYK WGWAA