Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0370 |
Symbol | |
ID | 7084876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | - |
Start bp | 419038 |
End bp | 420036 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643697400 |
Product | integrase family protein |
Protein accession | YP_002354048 |
Protein GI | 217968814 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGA CGCCCAGTTT CCCTGCGTTG TTGCAGCGCT TCTTTACCGA CCGGCTGATG CAGCAGCGTC GGGCAAGCCC GCATACGATC GGCTCCTATC GGGATACCTT TCGTCTGCTG CTGCGCTTTG CCCAGATGCG ACTCGGCAGG CAGCCCTCGC AGTTGGCGTT CGAGCAGATC GATGCGCCAC TGATCGCCGC TTTCCTCGAC GAGCTCCAGG ACAAACGCGG GATCACGGCC CGCAGCCGCA ACCTACGCCT CACCGCGATT CGCTCGTTCT TCCGCTATGC CGCTTTCGAG GCGCCGACCC ATGCGGAGCA GATTCAGCGG GTGCTCGCGA TTCCCAGCCA GCGTTTCACG CGTACCCAGG TCGGGTTCCT GACCCGACCG GAAGTCGACG CGCTGCTGGC CGCGCCCGAT CGGCAGACGT GGTCCGGACG CCGGGACCAC GCGCTGCTCC TGCTCGCAGT TCAAGCCGGA CTACGCCTGT CCGAGCTGAC CGCGATGCGG CGAGATACGG TGATCCTGGG ATCCGGTGCC CACGTCCTGG TCATGGGGAA GGGCCGCAAG GAACGCGCCA CACCGCTGAC CCGACAGACC GCTGCCGTCC TCAATGCCTG GTTGAAGGAG ATTCCCGCCA ACCCCGACGC GACAATATTT CCGAGTGCCC GTGGCACTCG CCTGAGCGCG GATGGCGTGC AGTATCTGCT CGCCAAACAC GTCGCGGTGG CGGCCCGGAG CTGCCCGTCC CTGGCCCAAA AGCGGGTCAC ACCGCACGTG CTTCGCCATA CCACCGCGAT GGAGTTGCTT CAGGCGGGTG TCGATCGTGC GGTAATCGCG CTGTGGCTGG GCCACGAGTC CGTGGAGACG ACGCAGATCT ATCTGGACGC GAACCTCGCC ATCAAGGAGC AAGCCTTGGC CCGGACTACA CCACCCGACA GCACCCCTGG GAGATTTCGA CCGGATGACC AACTGCTCGC CTTCCTCACA GAACTCTGA
|
Protein sequence | MSATPSFPAL LQRFFTDRLM QQRRASPHTI GSYRDTFRLL LRFAQMRLGR QPSQLAFEQI DAPLIAAFLD ELQDKRGITA RSRNLRLTAI RSFFRYAAFE APTHAEQIQR VLAIPSQRFT RTQVGFLTRP EVDALLAAPD RQTWSGRRDH ALLLLAVQAG LRLSELTAMR RDTVILGSGA HVLVMGKGRK ERATPLTRQT AAVLNAWLKE IPANPDATIF PSARGTRLSA DGVQYLLAKH VAVAARSCPS LAQKRVTPHV LRHTTAMELL QAGVDRAVIA LWLGHESVET TQIYLDANLA IKEQALARTT PPDSTPGRFR PDDQLLAFLT EL
|
| |