Gene TM1040_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3744 
Symbol 
ID4075451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp801317 
End bp803035 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content61% 
IMG OID638005264 
Producttetratricopeptide TPR_2 
Protein accessionYP_611973 
Protein GI99078715 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGTAT CCTATCTCCG TTCTTTGACC TGCGCAGCCG CGCTTTTGCT GACCACCGGG 
CCGATGGCGG TGGCGGATGG GCTGGCGGGC GCGTATCTCG CTGGACGGGC CGCGACCTAT
GAAAGCGACT TTGCCGCCTC CGCCAAATAT TACACGCGTG CACTGGTGCG CGATCCGCAG
AACATCACCC TGATGGAGAA CCTCGTTTAT GCGCAGCTTG CGCTTGGAGA GGTCGAATCC
GCGTTGCCGG TGGCAGAGCG GATGTGGCAG GCAGGTGTGA GCAGTCAGGT CGCCAATATC
GTCATGGCAG GCAATCTTGC ACTGCAGGAA AACTATGACG CCCTGCTTGC ACGCGATTCC
GAACAGTTTG AAATCAGCCC GCTGGTCGAC GGGCTGCTGG ATGCTTGGGC CTATATGGGC
AAAGGCGCGG TCTCGCAGGC ACTCGACCAG TTCGACGCCG TGGCACAACA GGACGGGCTG
CGCTTTTTTG CTTTGTACCA CAAGGCCTTG GCGCTTGCAT CGGTCGGAGA TTACGAGGGC
GCAGACCAAC TGTTTGCCGC CAATGAGGGC CAGCTAGGCA GGTCCTCGCG CCGGGCTGCA
ATCGCGCGGA TACAGGTCCT GTCACAACTC GGGCGCAACG ATCAGGCGCT TGAGGTGCTG
GTCGACAGCT TTGGCGAAGG CTTCGACCCT GCGCTTACGG AGTTTGCAGA TCAACTCGCC
ATGGGAGAGA CCTTACGGTT CTCGATTACC CCGACCGCAC GCGATGGCAT GGCAGAGGTG
TTCTACAGCC TTGGTCAGGC GCTTTCGGGC GAGGCAGCGA GTGACTATGT GCTGATGTAT
GCACGCATGG CCGCAAAACT CAGCCCCGGC CATGTGGATG CCGTGCTTCT GAGCGCGGGG
CTTCTGGATC AAATGGGTCG TTACGAACTG TCGATCGCCA CCTACAAGCA GGTGCCGCGT
GATCACCCTG ATTTCCATGC TGCCGAGCTT GGTCGTGCCG AGGCGCTGCG GCGATCGGCC
AATCCGCAGG CCGCCGCCGA AGTGCTGGAA CAACTGGCGC GCGATTTCCC GCAGCATGTC
GCGGTCTATA TCGATCTGGG CGACCTTATG CGGCAGCAGG AAAACTATGC CGAAGCCGCA
AAGGCCTATA CCCGCGCGCT CGAACTGAGC CCCGATGAGA CAACGAACCG CTGGTTCCTT
TATTATGCGC GCGGCATCTG TAACGAACGT CTGAAGAACT GGGAGGCGGC CGAGGCGGAT
TTTCGCGCCG CGCTTGAGAT CGACCCGGAC CAGCCCCAGG TTCTGAACTA CCTAGGCTAC
TCCCTTGTGG AGCGGCAGGA AAAACTGGAC GAGGCACTTT CCATGATCGA GCGTGCCGTC
GCTGCGCGCC CCGAGAGTGG CTATATCATC GACAGCCTTG GATGGGTTCT TTACCGGATG
GGCCGCTATG ACGAAGCCGT CGGTCACATG GAACGCGCAG TCGAGTTGAT GCCCGTGGAT
CCGGTGGTGA ACGATCACCT CGGAGATGTC TATTGGGCGG TTGGGCGCAA GCTGGAGGCC
GAGTTCCAGT GGCGGCGCGC GCTTTCCTTT GTGGAGCCCG AGGATAAGGA CGCCGAGGCC
AACCCGGATC GCATTCGCCG CAAGCTCGAC GTTGGGCTTG ATGTGGTCCT GGCCGAAGAA
GGGGCAGAAC CGCTTCAGGT TGCGAATGAC GATCACTGA
 
Protein sequence
MPVSYLRSLT CAAALLLTTG PMAVADGLAG AYLAGRAATY ESDFAASAKY YTRALVRDPQ 
NITLMENLVY AQLALGEVES ALPVAERMWQ AGVSSQVANI VMAGNLALQE NYDALLARDS
EQFEISPLVD GLLDAWAYMG KGAVSQALDQ FDAVAQQDGL RFFALYHKAL ALASVGDYEG
ADQLFAANEG QLGRSSRRAA IARIQVLSQL GRNDQALEVL VDSFGEGFDP ALTEFADQLA
MGETLRFSIT PTARDGMAEV FYSLGQALSG EAASDYVLMY ARMAAKLSPG HVDAVLLSAG
LLDQMGRYEL SIATYKQVPR DHPDFHAAEL GRAEALRRSA NPQAAAEVLE QLARDFPQHV
AVYIDLGDLM RQQENYAEAA KAYTRALELS PDETTNRWFL YYARGICNER LKNWEAAEAD
FRAALEIDPD QPQVLNYLGY SLVERQEKLD EALSMIERAV AARPESGYII DSLGWVLYRM
GRYDEAVGHM ERAVELMPVD PVVNDHLGDV YWAVGRKLEA EFQWRRALSF VEPEDKDAEA
NPDRIRRKLD VGLDVVLAEE GAEPLQVAND DH