Gene TM1040_3794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3794 
Symbol 
ID4074888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008042 
Strand
Start bp43437 
End bp45443 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content60% 
IMG OID638004453 
ProductTRAG protein 
Protein accessionYP_611188 
Protein GI99077929 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCGACG CCATGGGAAA AGCGCGGATC GCAACTGGGA TCTTGCTCGT CACGCTGGTG 
ACCGGGGCCA TGGGATATAC CATCGCCTCG GCGGTTCTGA GCCACCAGGA TCTCGGCTTC
GGGGCGGAGA TCGACTTTGC TTATATCGCG CAGAACTATC TGGCGATCCT CGATCGCCGC
CCGGAGGACG CGCAACTCAT CCACCTGATC ATCGGCAGCT TCGCCGCTGC CGGCCTGATG
CTGAGCCTTG CCCTGTCGGG CTCGGCCCTC ACACGCTTTG GCCAGACCCA TTGGCAGTCT
GCGCGCGAGA TGAAGGCCAA CGGGTTTTTC GGCGCTCCCG GGACCGGGTT CATCCTTGGT
AAGCTCGGGA CACCAAAATC CCGCGCGAAA TTCGTCTGCT CAAAAGTCTT CCCGCACGCG
CTGATCGTGG CCCCCACCGG GCGCGGCAAG ACCACGGGCT TCGTCATTCC AAACTTGCTG
ACCTGGCAAG GCTCCGCCGT GACGCTCGAT GTGAAGGGGG AATGTTTCGA GGCCACGGCG
CGCCACCGCG CCGCCCAGGG CGACAAGGTC TATCGCTTTG CTCCGACGGA TTGGGAGGGC
AAGCGCACGC ATCGCTACAA CCCGCTCTTG CGCATCTATC AACTGAAAGA TCCCGCGCGC
CAACAGATGG AGTTGCAACT TCTTGCTACG CTTTTCTTAC AGAGCGACAA CGACCGGGTG
CAGGGCCTCC TCAAAGGCGG GATTGATCTC TTTGTGGCAG CAGGCCTGCT GGCGTTCCAG
CGCAAGCGTC CGACCTTGGG CGAGATCTAC CGCATCGCGG CCTCGGGCGG GAACAAGCAG
AAGGAGTATT TCGCGCGGGG CCATGAAGTT GACAACAGGG CGGCCAAGCT GATCTTCACG
CGGCTGGCGT CCACCAACAA CGATACTCTG ACGTCTTATG TTTCACTCCT GATGACCTCA
GGGCTTGATC AATGGCAGAA CCCCGCGATC GATGAGGCGA CGGCAGTGTC GGATTTCGAC
TTCCGGACAA TCCGCAAAAA GCCCTTCTCT GTTTATCTCG TAGTCCAGCC GCTGATGGTC
AAACCACTTG CCCCTCTGAT CCGGCTCTTT TTCTCCGATC TCCTTTCGGC GATGCAGGAA
AAGGACCCTG GGCCGGATGA GCCGTGGCCT GTGATGATCA TGCTCGATGA ATTCAACCGT
CTTGGCAAAA TGCCTATCGT GGTCGAAAGC ATCGAGACCC TCCGCAGCTA TAGCGGTCAT
CTGGCCGTCG TCACCCAGAC GATTCCCGCC CTCGATGAAA TCTATGGTGA GAATACCCGC
CGAGCCCTGC AGGGCAACGC AGGGGTAAAG CTCTACCTAA CCCCGTCTGA CGAAAAAACC
GTCGAGGAGC TGAGTAAGGC GGTCGGCAAG ACCACAAAGA CCGTGGTCAC GCGGTCGCAA
TCCATCGGCA AGAACCCCTT CGAGGGCCGC AGCCAATCCA CACGGACCGA AGAAAGCTCC
TTGCTTCCTG AAGATGAAGC ACGCCGCCTG CCACTCGACG AGATCGTCAT GGTCATCGAT
GCCCAAATGC CGGTCCGGGC GAAGCGAATC CAGTATTTTG ACGACCGCCT GTTCAAAACG
ATCCATGATG CACAGACGGG AGACTTGCCG TTTCCAGAGC CGGGGGGCGC GCAGGGTAAG
CTGCCACTTA CTATGCGCGC GATGCCGATG GCACCGCCAC CGGACGAGTC AAGCGGACCC
GAGGCCGACG GTAAGGCCGC ACGCCAGACT GCTGACGGCC AATCGTCTGG GCCATCTGGC
GCTGTTCCCA AGAAGACCGC GCCCATCGTT CAAGCTGTGA TCGCCGAGGC GCAGCGACAG
ATGGAAATGG ATCTTGAAGG TGCAGTTGCT GACGCTGAGG CCGCGCGCAT CGTTGATGAG
GCCCAGATGC GCTCCGCTGT CGATGGTTTG AACGACATGG AAGCTATGCT GCAGGAGGAT
CGCGGTCAAA AGCTGGTTGG TCGGTAG
 
Protein sequence
MSDAMGKARI ATGILLVTLV TGAMGYTIAS AVLSHQDLGF GAEIDFAYIA QNYLAILDRR 
PEDAQLIHLI IGSFAAAGLM LSLALSGSAL TRFGQTHWQS AREMKANGFF GAPGTGFILG
KLGTPKSRAK FVCSKVFPHA LIVAPTGRGK TTGFVIPNLL TWQGSAVTLD VKGECFEATA
RHRAAQGDKV YRFAPTDWEG KRTHRYNPLL RIYQLKDPAR QQMELQLLAT LFLQSDNDRV
QGLLKGGIDL FVAAGLLAFQ RKRPTLGEIY RIAASGGNKQ KEYFARGHEV DNRAAKLIFT
RLASTNNDTL TSYVSLLMTS GLDQWQNPAI DEATAVSDFD FRTIRKKPFS VYLVVQPLMV
KPLAPLIRLF FSDLLSAMQE KDPGPDEPWP VMIMLDEFNR LGKMPIVVES IETLRSYSGH
LAVVTQTIPA LDEIYGENTR RALQGNAGVK LYLTPSDEKT VEELSKAVGK TTKTVVTRSQ
SIGKNPFEGR SQSTRTEESS LLPEDEARRL PLDEIVMVID AQMPVRAKRI QYFDDRLFKT
IHDAQTGDLP FPEPGGAQGK LPLTMRAMPM APPPDESSGP EADGKAARQT ADGQSSGPSG
AVPKKTAPIV QAVIAEAQRQ MEMDLEGAVA DAEAARIVDE AQMRSAVDGL NDMEAMLQED
RGQKLVGR