Gene Tmz1t_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_1628 
Symbol 
ID7084838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp1823370 
End bp1826144 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content68% 
IMG OID643698648 
ProductPEP-CTERM system TPR-repeat lipoprotein 
Protein accessionYP_002355279 
Protein GI217970045 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.57702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTACCC GACCCGAACG TCGCGCAGTG CGCACGATGA CGCTCTGCGC CACGCTCCTG 
CTCGCCGCCT GTGGCGACAG CGGCAGCCCA CAACACTACG TCGCACTCGC CCGCGACCAC
CTTGCCAACG GCGCCTATCG CGAAGCGACC ATCGAGCTGA ACAACGCACT CCAGAAGGAT
CCGAAGAACC GCGAGGCACG CTGGCTGCTG GCGCAGGCGG CGCTCCAGCT CGGCGAAGCC
GACAAGGCCG AACGCGATGC ACGCAAGGCG ATCGAATACG GGTTCTCGCG CACCGAAGCC
CTACCCCTGC TGGCGCGGGC GATCCTGATG CAGCAGGCGC CCGACCGGGT TCTCACCGAA
CTCTCCACCG CCCCGACGGA CGCGCCCGAC ACAATGCAAG CCGAGTATGC AAGTCTGCGT
GGCACCGCGC TCCTGCTCAA GGGAGAACTC GACGCTGCCG AGCCCGAGTT CGGCAAGGCG
CACAAGCTGG ACCCCGCCCT GCCGGAGGCC ATCGTCGGCC TCGCGCTGGC GCAGAGCTTG
CGCAAGCAGT ACGACGAGGC GCGCAAGACG CTTGCACCCG CATTAGAGCG CACGCCGCCG
GTCGCCGACG CGTGGTCGCT GCTCGGCGAC ATTGAAACCG AGCAGGAGCG CTTCGACGCC
GCCGAGACCG CCTTCGGCCA GGCGATCCGG GCGCGTGCCC ATGTCACCCT CGAGCGCGCG
AAGCGCGCCC TCGCCCGCGT GCGCCAAGGC AAGTTCGCAG AGGCCGAGGC GGACCTGAAC
GCACTCGGAG CGCTTTCTCG CCATCCCTAT GCGCAGTACG TCACCGGCCT GTCCCATTTC
CGCCAGCAGC GCCTGCGCGA GGCGGCTGAC GCATTCGAAC TTTCCTTGGC CGCCGATCCG
AATTTCGCAC CCAACCGGGT CTATCTCGCC ATCACCCGGC TGATGCTCGG CCAGCAGGAA
CAGGCGCTGG CGCACGCCGA ATTCATCCGT GCTGCCGCAC CGCAGGCTTC GGGCGCGAAC
CTGCTCCTCG GCATCGCCCA GGCTGGTCAC GCCGACTACG GCCAGGCGCG CAAGACGCTC
GAAGCCGCCT TGGCCTCCGA GCCGGACAAC GTCACCAATT TGCAGTTGCT GGCCACGCTG
AGCCTGCTGC AGGGCGACAG CAAGACCGCA CTCTCCCATG CCCAGCGTCT CGCCACGCTG
CGTCCGGACT CGACGGGCGC CATCAACCTT TTAATGATGG CGCAGTTGAT GTCGGGTGCG
GAAGTGCCCG AGCCGACCGC CGGTCAGGTC GATGCGCTCC AGGCCGAGTT CATGCAGGCC
CTCGAGGCCT TCCGCGACAA GCGCTTCGGC GAAGCGACAA AGCGCGCCGA GGCCCTGCGC
GCAGCGCATC CCGAACAGAT CGGGCCGATC AACCTGCTCG CCGCCCTGTA TCTTTCCACC
GGCCAGTGGC CCAAGGCGCG CAAGGAACTC GAGACCGTCC TGCAACGCCA GCCCGCGGAT
GCGACCGCAC GCATCAACCT CGCCAAGCTC GAACTGCAAG ATCGCAACTT CCAGCGCATT
AAGGAGCTCG TGAGCCCGCT CGTGCTCGCG TCGCCTTCGG CGGAAGCGCC CGCGCTCCTG
CTTGTGGCCG CCGAGCATGG GCTGCAGAAC GACGTCGCCG CGGACCAGGT GCTTGAGCAG
TTGGTCAAGA GCAACCCCTC GGCCACGCTT GCACGTGCCC TGGCCGCAGG CCGTGCGCTA
CGAGGCAACA AGCCGGAACG CACGCTCGAA TACCTTGCCC AGTTCGACAA GGCACGCATC
GAGTCCTCGC CGGCGCTCCT CGAACTGCGC GGGCGCGCTC ACCTGGCCCT CGCCCAGAAC
GCCGAAGCCC TCGGCAACTT CGAGCGCTGG GCGCACTTGG CCCCCGAATC CGCTGCAGCG
CACTTCCTGC ATGCCGAAAC CCTCGCTCGC CTGGGACGCA TCCGTGACGC CGAGGGCGCG
CTGGTCCGGG CCGTCAAGCT CGACCCGACC AACCCCGAGG TGCGCATCGC CGAGGTACGC
ATGCTGACCC GGACGGGACA ACTCGACAAG GCAAAGAGCG CTGCGCAGCG CCTGCGCAAG
GATTTCGGCG ACCGCCCGGA CATCCTCGCC ACCACCGGCT GGCATGCATT GATGACCAGC
GAGTTCGCCA TCGCGGCCGA CCACCTCGGG CGTGCATTCG CGCAGACGCC GAGCACAGCC
CTCCTCCTCG AGAGGATGGC CGCGTTGTGG GGCATGGAGA AGCGGGATGA AGCCCTCGCG
CTCATCCGGG ACTGGCTGGC CGAACACCCG CAGGACAGCG CAGCGCTGCT CCAGCTTGCC
GGCGCATATC TGGAGCTCGG CCAGGACGAG GAGGCAGTCC GCATCTATCG CAAGGTGCTG
GAGCTCGACC CGGCGCACGT CCCGTCGCTC AACAACGTCG CTTGGCTGTT GCGCCGCAGC
CGGCCGGACG AAGCCCTTCA AACGGCCCGT CGCGCCCTCG AGCTTGCCCC CAAGGACCCG
AACGTGCTCG ATACCGTCGG CATGTTGTAT CTGGACCGCA GGGACCTGAC GCAGGCAGGG
TGGTACGTCG GCAAGGCGCA TGAGCAGAAC CCGCGCAACC ACCAGATCAG CCTGCACCTC
GCCGAGGTCG CACACGCCAA GGGCAACACC GCAGAGGCGC TCAAGCTCAT CGACGGCGTG
CTTGCAGAGG CGCGCGATCC TGCACTTCAC AAACAGGCCG AGAGCCTGCG TGCAACGCTC
GAGAGCGGGC GCTGA
 
Protein sequence
MPTRPERRAV RTMTLCATLL LAACGDSGSP QHYVALARDH LANGAYREAT IELNNALQKD 
PKNREARWLL AQAALQLGEA DKAERDARKA IEYGFSRTEA LPLLARAILM QQAPDRVLTE
LSTAPTDAPD TMQAEYASLR GTALLLKGEL DAAEPEFGKA HKLDPALPEA IVGLALAQSL
RKQYDEARKT LAPALERTPP VADAWSLLGD IETEQERFDA AETAFGQAIR ARAHVTLERA
KRALARVRQG KFAEAEADLN ALGALSRHPY AQYVTGLSHF RQQRLREAAD AFELSLAADP
NFAPNRVYLA ITRLMLGQQE QALAHAEFIR AAAPQASGAN LLLGIAQAGH ADYGQARKTL
EAALASEPDN VTNLQLLATL SLLQGDSKTA LSHAQRLATL RPDSTGAINL LMMAQLMSGA
EVPEPTAGQV DALQAEFMQA LEAFRDKRFG EATKRAEALR AAHPEQIGPI NLLAALYLST
GQWPKARKEL ETVLQRQPAD ATARINLAKL ELQDRNFQRI KELVSPLVLA SPSAEAPALL
LVAAEHGLQN DVAADQVLEQ LVKSNPSATL ARALAAGRAL RGNKPERTLE YLAQFDKARI
ESSPALLELR GRAHLALAQN AEALGNFERW AHLAPESAAA HFLHAETLAR LGRIRDAEGA
LVRAVKLDPT NPEVRIAEVR MLTRTGQLDK AKSAAQRLRK DFGDRPDILA TTGWHALMTS
EFAIAADHLG RAFAQTPSTA LLLERMAALW GMEKRDEALA LIRDWLAEHP QDSAALLQLA
GAYLELGQDE EAVRIYRKVL ELDPAHVPSL NNVAWLLRRS RPDEALQTAR RALELAPKDP
NVLDTVGMLY LDRRDLTQAG WYVGKAHEQN PRNHQISLHL AEVAHAKGNT AEALKLIDGV
LAEARDPALH KQAESLRATL ESGR