Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_4035 |
Symbol | |
ID | 7873680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4432684 |
End bp | 4433904 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643700971 |
Product | plasmid encoded RepA protein |
Protein accession | YP_002890994 |
Protein GI | 237654680 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTTAACA GCCGCTCGCA CGCACAAGAG GAGACAAAAA AGAACAATCG CAAACGGGTA CGTCTTTCTG CCACAGCACT GGGTCTTGGC TTGACGGCGC TAATGTCCTT GAGCAGCGCT AACGCTGAAG ACAAAGTCAA AGTTGGCTTG ATGCTTCCGT ATACGGGCAC TTATGCCGCA CTCGGCACCG CAATCACCAA CGGTTTTAGA CAGTATGTGA ACGAACATGG TGGTAAGCTC GCTGGACGCG AGGTAACGTA TTTCGTGGTC GATGACGAGT CCGATCCAGC CAAAGCAACC GAAAATGCTA ATCGTCTTGT CAAGCGGGAC GAAGTCGATG TCTTGGTAGG AACAGTCCAT TCCGGTGTTG CACTAGCGAT GGCAAAGGTA GCCCGCGACA ACAAGACCTT GACCATCATC CCGAATGCGG GCGCCGACGA ACTCACTGGC CAGCTGTGCG CCCCGAACGT TTTCCGAACT TCGTTTTCAA ATTGGCAGCC GGCGTTTGCC ATGGGCAAAG TCATGGCTGC AAGAGGGCAC AAAAAGGTGG TCACCCTGAC CTGGAAATAC GCGGCTGGCG AGCAATATGT GCGGGGCTTC AAGGAAGCTT TCGAAAAGGA AGGTGGCCAA GTGATTAGCG AGTTGTACCT GCCTTTCCCC GGCGTGGAGT TTCAGCCCTT CCTGACGCAG ATCAGCAGCC TCGGCGCAGA CGCGGTGTAC GTTTTTTTTG CCGGCAGCGG AGCTGCCAAG TTTGTTAAGG ACTATGAAGC CGCAGGACTC AAGGCAAATC TTCCGCTTTA CGGAACGGGC TTCCTGACTG ACGGGACGCT GGAGGCAATG GGTGGCGCGG GTGAAGGACT ACTGACCACG CTGCACTACG CTGACGGCCT CGAAAATCCG GTAGACAAGG CGTTCCGAGC CGGGTACGTC TCAGCCCACA AGGTGCAGCC TGACGTCTAC GCCGTGCAGG GATATGATGC AGCCCAGTTG TTGGCGGCGG GTCTGGCGGG CTCTCCTTCA GGCAAGTTTG ACAAAGAGGC CGTTATGAAG GCGATGAGCG CGGCGAAGAT TGAAAGTCCA CGCGGCAGCT TTAATTTGTC GGCAGCCAAC AACCCGGTGC AGGATATTTA CCTACGCAGG GCCGAAGGTA AACAGAACAC GATCGTGGAG ATTGCGGTTC CCAAGCTTGC CGACCCTGCG CGCGGTTGCC GCATGAATTG A
|
Protein sequence | MLNSRSHAQE ETKKNNRKRV RLSATALGLG LTALMSLSSA NAEDKVKVGL MLPYTGTYAA LGTAITNGFR QYVNEHGGKL AGREVTYFVV DDESDPAKAT ENANRLVKRD EVDVLVGTVH SGVALAMAKV ARDNKTLTII PNAGADELTG QLCAPNVFRT SFSNWQPAFA MGKVMAARGH KKVVTLTWKY AAGEQYVRGF KEAFEKEGGQ VISELYLPFP GVEFQPFLTQ ISSLGADAVY VFFAGSGAAK FVKDYEAAGL KANLPLYGTG FLTDGTLEAM GGAGEGLLTT LHYADGLENP VDKAFRAGYV SAHKVQPDVY AVQGYDAAQL LAAGLAGSPS GKFDKEAVMK AMSAAKIESP RGSFNLSAAN NPVQDIYLRR AEGKQNTIVE IAVPKLADPA RGCRMN
|
| |