Gene Tmz1t_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_4035 
Symbol 
ID7873680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4432684 
End bp4433904 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content56% 
IMG OID643700971 
Productplasmid encoded RepA protein 
Protein accessionYP_002890994 
Protein GI237654680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTTAACA GCCGCTCGCA CGCACAAGAG GAGACAAAAA AGAACAATCG CAAACGGGTA 
CGTCTTTCTG CCACAGCACT GGGTCTTGGC TTGACGGCGC TAATGTCCTT GAGCAGCGCT
AACGCTGAAG ACAAAGTCAA AGTTGGCTTG ATGCTTCCGT ATACGGGCAC TTATGCCGCA
CTCGGCACCG CAATCACCAA CGGTTTTAGA CAGTATGTGA ACGAACATGG TGGTAAGCTC
GCTGGACGCG AGGTAACGTA TTTCGTGGTC GATGACGAGT CCGATCCAGC CAAAGCAACC
GAAAATGCTA ATCGTCTTGT CAAGCGGGAC GAAGTCGATG TCTTGGTAGG AACAGTCCAT
TCCGGTGTTG CACTAGCGAT GGCAAAGGTA GCCCGCGACA ACAAGACCTT GACCATCATC
CCGAATGCGG GCGCCGACGA ACTCACTGGC CAGCTGTGCG CCCCGAACGT TTTCCGAACT
TCGTTTTCAA ATTGGCAGCC GGCGTTTGCC ATGGGCAAAG TCATGGCTGC AAGAGGGCAC
AAAAAGGTGG TCACCCTGAC CTGGAAATAC GCGGCTGGCG AGCAATATGT GCGGGGCTTC
AAGGAAGCTT TCGAAAAGGA AGGTGGCCAA GTGATTAGCG AGTTGTACCT GCCTTTCCCC
GGCGTGGAGT TTCAGCCCTT CCTGACGCAG ATCAGCAGCC TCGGCGCAGA CGCGGTGTAC
GTTTTTTTTG CCGGCAGCGG AGCTGCCAAG TTTGTTAAGG ACTATGAAGC CGCAGGACTC
AAGGCAAATC TTCCGCTTTA CGGAACGGGC TTCCTGACTG ACGGGACGCT GGAGGCAATG
GGTGGCGCGG GTGAAGGACT ACTGACCACG CTGCACTACG CTGACGGCCT CGAAAATCCG
GTAGACAAGG CGTTCCGAGC CGGGTACGTC TCAGCCCACA AGGTGCAGCC TGACGTCTAC
GCCGTGCAGG GATATGATGC AGCCCAGTTG TTGGCGGCGG GTCTGGCGGG CTCTCCTTCA
GGCAAGTTTG ACAAAGAGGC CGTTATGAAG GCGATGAGCG CGGCGAAGAT TGAAAGTCCA
CGCGGCAGCT TTAATTTGTC GGCAGCCAAC AACCCGGTGC AGGATATTTA CCTACGCAGG
GCCGAAGGTA AACAGAACAC GATCGTGGAG ATTGCGGTTC CCAAGCTTGC CGACCCTGCG
CGCGGTTGCC GCATGAATTG A
 
Protein sequence
MLNSRSHAQE ETKKNNRKRV RLSATALGLG LTALMSLSSA NAEDKVKVGL MLPYTGTYAA 
LGTAITNGFR QYVNEHGGKL AGREVTYFVV DDESDPAKAT ENANRLVKRD EVDVLVGTVH
SGVALAMAKV ARDNKTLTII PNAGADELTG QLCAPNVFRT SFSNWQPAFA MGKVMAARGH
KKVVTLTWKY AAGEQYVRGF KEAFEKEGGQ VISELYLPFP GVEFQPFLTQ ISSLGADAVY
VFFAGSGAAK FVKDYEAAGL KANLPLYGTG FLTDGTLEAM GGAGEGLLTT LHYADGLENP
VDKAFRAGYV SAHKVQPDVY AVQGYDAAQL LAAGLAGSPS GKFDKEAVMK AMSAAKIESP
RGSFNLSAAN NPVQDIYLRR AEGKQNTIVE IAVPKLADPA RGCRMN