Gene TM1040_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_1511 
Symbol 
ID4077067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp1617419 
End bp1618765 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content61% 
IMG OID638006824 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_613506 
Protein GI99081352 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.565949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.673528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAGT CAGACGTAAG CCGTCGCGGC CTGCTGAGAA CCGGTGCGGT CGCGGGTGCA 
GGGCTCGCCA TGCCGACCAT CTTTACGGCC CAAAGCGCGC ATGCATTCAC CAACAACCCC
ACTGGCGGCA CTGTTACACT CGGCTTTAAC GTCCCGCAGA CCGGCCCTTA CGCCGATGAA
GGTGCGGACG AGTTGCGCGC CTATGAGCTG GCGGTCGAGC ACCTGAACGG CGGTGGCGAT
GGCGGCATGC TCACCACCTT CAGCTCCAAG GCTCTGCAGG GCAATGGCAT CCTGGGCAAG
AAAGTCGAAT ATGTCACCGG CGATACCCAG ACCAAATCCG ATGCGGCGCG CGCTTCTGCC
AAGTCCATGA TCGAAAAAGA CGGTGCGATC ATGATCACGG GCGGCTCGTC TTCGGGTGTG
GCTGTGGCCG TGCAGGCGCT CTGCCAAGAG GCAGGCGTAA TCTTTATGGC GGGTCTTACC
CACTCCAATG ACACCACAGG CAAAGACAAG CGGGCCAATG GTTTCCGCCA CTTCTTCAAC
TCTTACATGT CTGGTGCGGC GCTGGCGCCG GTGCTGGCGA ATGCCTACGG CACCGACCGT
AAGGCCTATC ACCTGACCGC CGACTACAAC TGGGGCTATA CCACCGAAGA AGCAGTCCGG
TCCTCCACCG AAGCGATGGG CTGGGAAACC GTGGCTGCGG TGAAAACACC GCTAACCCAG
ACCGACTTCT CGTCCTATAT CGCCCCGGTT CTGCAGTCCG GTGCCGACAC GCTGGTTCTG
AACCACTACG GCGGCAACAT GGTGAACTCT CTCACCAACG CGGTGCAGTT CGGCCTGCGC
GACAAGCAGG TGAACGGCAA GGACTTCCAG ATCGTTGTTC CGCTCTACTC CCGCCTGATG
GCGAAAGGTG CGGGCGCCAA CGTGAAGGGC ATCTTCGGCT CCACCAACTG GCACTGGTCG
CTGCAGGACG AAGGTTCCAA GGCCTTTGTA CGCTCCTTCG GCACCAAATA CGGCTTCCCG
CCGAGCCAGG CCGCTCACAC CTGCTATGTG CAGACCCTGC TCTATGCAGA CGCGGTTGAA
CGCGCTGGCT CCTTTGCGCC CTGCGCCGTG GCAGAAGCGC TCGAGGACTA TGAGTTCGAC
GGTCTGGGCA ACGGCAAGAC GCTCTATCGT GGCGCCGATC ACCAGTGCTT CAAGGACGTG
CTGGTTGTGA AAGGGAAAGA GAACCCGACC TCGGAGTTCG ACCTTCTCGA AATCGTCGAA
GTCACCCCGG TTGGCCAGGT CACCTATGAC CCGAACCACC CGCAGTTCCA GGGCGGTGCG
CTCGGCACCT GCAACAACGG CGCCTAA
 
Protein sequence
MSKSDVSRRG LLRTGAVAGA GLAMPTIFTA QSAHAFTNNP TGGTVTLGFN VPQTGPYADE 
GADELRAYEL AVEHLNGGGD GGMLTTFSSK ALQGNGILGK KVEYVTGDTQ TKSDAARASA
KSMIEKDGAI MITGGSSSGV AVAVQALCQE AGVIFMAGLT HSNDTTGKDK RANGFRHFFN
SYMSGAALAP VLANAYGTDR KAYHLTADYN WGYTTEEAVR SSTEAMGWET VAAVKTPLTQ
TDFSSYIAPV LQSGADTLVL NHYGGNMVNS LTNAVQFGLR DKQVNGKDFQ IVVPLYSRLM
AKGAGANVKG IFGSTNWHWS LQDEGSKAFV RSFGTKYGFP PSQAAHTCYV QTLLYADAVE
RAGSFAPCAV AEALEDYEFD GLGNGKTLYR GADHQCFKDV LVVKGKENPT SEFDLLEIVE
VTPVGQVTYD PNHPQFQGGA LGTCNNGA