Gene Tmz1t_2921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_2921 
Symbol 
ID7873823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp3161918 
End bp3163270 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content65% 
IMG OID643699842 
Productflagellar basal body FlaE domain protein 
Protein accessionYP_002889897 
Protein GI237653583 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.926176 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTCC AGCAAGGTTT GAGCGGCCTG AGCGTTTCCT CCAAGGCCCT GGACATCATC 
TCCAACAACG TCGCCAACAC CAACACGATC GGCTTCAAGA CCGGCTCCGC GATCTTCTCG
GACGTCTTTG CGGCCTCGCT GACCGGCTCG ATCTCGGGCA AGCAGGTGGG TGTGGGCGCC
ACGCTCGGTG CGGTGCGGCA GACCTTCACC CAGGGCAACC TCACCACCAC CAACAACCCG
CTCGACCTGG CGATCAACGG CGACGGCTTC TTCGTGGTCA GCCGCGAGAA CGGCCCGGAC
GTGTATACCC GCAACGGCCA GTTCGAGCTC GACAAGGAGG GTTTCATCCG CACCCCCACC
GGCGAAAGGC TCAAGGGCTT CCAGAGTCCG GTCACCCCGG GCGCCCTGCC CACGCCGATC
GGCGGCCTCA AGGACGTCCA GGTGCCGATC GTCGGTTCCC AGCCCATGCC GACCAGCCGG
GTGGAGATCG GCGGAAACCT CGATGCGAAC GACCTGAGCG CAGCCCAGCG CTACCCACAG
CTGTTCGGCT CGACCTTCAC GTTTCCGCTC CCTGCGGGCG AGAACTGGCA GGACGCGCGC
ACCTACAATT TCTCGACCTC GATCAAGGCG TACGACAGCC TCGGGAAGTC GCACGAGCTC
ACCTACTACT TCGCCAAGGT GACCCCGACC CCGAACGATC CGGTCAACAC GAACGAGAAC
ACCTGGAAGG TCTTCACCAG CGTGGACGGC GGCTATCCCT TGGGGGTCGG GACCGATGGC
TCGATCGAGG ACCCGAGCGT GATCTGGAGC CTGACCTTCA ACGAGAAGGG TACGGTGAAC
ACGCCCATCG TCGGGCCGAC GGCTGCGAAT CCGCTGGCGA TCCCGGCTTC CTACACGCCG
GGTGCGGACC GCATCCGTTT CCACGTCGAT TTCAGCGACA TGCGCCAGAT CGGTGGCTCC
TACCTGATCA CCGAGCTCAC CCAGGACGGC TACACCGGCG GCGAGCTCGC CGGAGTCAGC
GTGGGACGGG ACGGCATCGT GACCGGTCGC TACACCAATG GCGAGACGCG CGAACTCGCC
CAGCTCGCGC TGCAGACCTT CCGCAATCCG AACGCGCTGC TGTCGATCGG CAACAACTTC
TGGGAAGCGA CCACCGAATC CGGCCTCACT GCGCCGTCCA AGGCCGGCGA AGGGGTCGCG
GGCGTGGTCT CGGCGGGCAT GATCGAGGAC GCCAACGTGG AGCTCACCAA CGAGCTGGTG
CAGATGATCG TGCAGCAGCG CAACTACCAG GCCAACGCGC AATCCATCAA GGCGCAGGAC
CAGGTCCTGC AGACCCTGGT GAACCTGCGC TGA
 
Protein sequence
MSFQQGLSGL SVSSKALDII SNNVANTNTI GFKTGSAIFS DVFAASLTGS ISGKQVGVGA 
TLGAVRQTFT QGNLTTTNNP LDLAINGDGF FVVSRENGPD VYTRNGQFEL DKEGFIRTPT
GERLKGFQSP VTPGALPTPI GGLKDVQVPI VGSQPMPTSR VEIGGNLDAN DLSAAQRYPQ
LFGSTFTFPL PAGENWQDAR TYNFSTSIKA YDSLGKSHEL TYYFAKVTPT PNDPVNTNEN
TWKVFTSVDG GYPLGVGTDG SIEDPSVIWS LTFNEKGTVN TPIVGPTAAN PLAIPASYTP
GADRIRFHVD FSDMRQIGGS YLITELTQDG YTGGELAGVS VGRDGIVTGR YTNGETRELA
QLALQTFRNP NALLSIGNNF WEATTESGLT APSKAGEGVA GVVSAGMIED ANVELTNELV
QMIVQQRNYQ ANAQSIKAQD QVLQTLVNLR