Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_4394 |
Symbol | |
ID | 8605754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 4992843 |
End bp | 4994459 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003301959 |
Protein GI | 269128589 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.497016 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGGCACG GCGCGGCGGG CGGCTATACG CACCGGCAGG TGCTCAAGAT CCTCAGCGGG CTGCTGATGG CCATGATCAC CGCGGTGATC TCCACCTCGG TGACCACCAT CGCGCTGCCC ACCATCATGG GCGAGCTGGG CGGGCAGGAA CAGCTCGCCT GGGTGGCCAG CGCCCCGCTG CTGGCCATGA CCGCCTCCAC CCCGCTGTGG GGCAGGCTGT CGGACATCTT CGGCCGCAAG CGGATGTACC AGACCGCGCT GCTGCTGTTC ACCGCCTCCT CGATCGCGGC CGGCCTCTCC CAGAACGTCG GCCAGCTCAT CGCGTTCCGG GCGCTGCAGG GGATGGGCGC CGGCGGGGCG ATGGCGCTCA CCCAGGTCAT CCTCGGCGAC ATCGTCGAAC CCCGGCAGCG CGGACGCTAC TCCGGCTACC TGGGCGCCGC CTACGGCCTG TCCACCGTGA CCGCCCCGCT GCTCGGCGGC TTCCTGGTGG ACGCTCCGCG CCTGGGCTGG CGCTGGTGCT TCTTCGTCAC CGTGCCGCTG GCGCTGGCGG CCCTCGCCAT CACCCAGTGG ACCCTGCACC GGCCGCCGCG CGAGACGGGC CGGGCCCGGC CGAAGATCGA CTGGGCGGGG GCGGTCACCA TCACCGGCGC GGCGAGCACC GCGCTGATCG TGCTGTCCCT GGGCGGCAAG GAGTTCCCCT GGAACTCTCC GTGGACCTAC GGCCTGACCG CGCTGACCCT GGTGCTGCTG GGGGCGGCGG TGGCGGCCGA ACGGCGGGCC GCCGCGCCCA TCCTGCCGCC CCGCCTGTTC GCCGACCGCA CCTTCGTGCT CGCCTCGGCG GCCTCGCTGA TGGTGGGCGT GGGCATGTTC GGGGTGATGA CCTACCTGCC GCAGTACCTG CAGGTGGTGC AGGGCATGAC ACCCACCGTC TCCGGGCTGC TGGCCCTGCC GATGACGATC GGCGTCCTGC TGGGCAGCAC CGTCTCCGGG CAGGTGGCCG CCCGCACCGG GCGCTGGAAG ATGTTCCCGG TGACCGGGAC GGCGCTGCTG GCGGCCGGGA TGTTCCTGCT GTCGCGGCTG CAGGTGGACT CCGGTCCGGT GGCGGTCGGC ATCGGCTGCG GGACGGCCGG GCTGGGCCTG GGCATGACCA TGACGATGCT GGTGCTGGCC ACGCAGAACG CCGCCGACCG CGCCGATATG GCCGCCGCCA CCTCGGGCGT CACCTTCTTT CGCAGCATGG GCGGCGCGGT CGGGGTGGCC GCGCTCGGCG CCGTGCTCAC CGCCCGGCTC ACCGCCGCGC TGACCGAGCG GCTGCGCGCC GCCCGCCTGC CGGTGCCCGA GCGGGTCGGC ACCGGGCTGG GCACCCCCGA GGAGATCCAC GCCCTGCCCG AGCCGCTGCG GGGCCTGCTG CTGGCGTCCT TCACCGAGGC GGTCCAGGCC GCCTTCCTGG TCGGGGTGCC GATCGCGCTG GCGGGCCTGG CGGCCGCGCT GGCCCTCAAG GAACTGCCGC TGCGCACCTC CTCCGCCCCC GCCGAGGACC GGCGGCCCCC GGCCGCGGCG GCACCGCCCG CCGATGCCGG GCGGGCCGCG GGCGCGGTGA GCCACCGGCG GACTTGA
|
Protein sequence | MGHGAAGGYT HRQVLKILSG LLMAMITAVI STSVTTIALP TIMGELGGQE QLAWVASAPL LAMTASTPLW GRLSDIFGRK RMYQTALLLF TASSIAAGLS QNVGQLIAFR ALQGMGAGGA MALTQVILGD IVEPRQRGRY SGYLGAAYGL STVTAPLLGG FLVDAPRLGW RWCFFVTVPL ALAALAITQW TLHRPPRETG RARPKIDWAG AVTITGAAST ALIVLSLGGK EFPWNSPWTY GLTALTLVLL GAAVAAERRA AAPILPPRLF ADRTFVLASA ASLMVGVGMF GVMTYLPQYL QVVQGMTPTV SGLLALPMTI GVLLGSTVSG QVAARTGRWK MFPVTGTALL AAGMFLLSRL QVDSGPVAVG IGCGTAGLGL GMTMTMLVLA TQNAADRADM AAATSGVTFF RSMGGAVGVA ALGAVLTARL TAALTERLRA ARLPVPERVG TGLGTPEEIH ALPEPLRGLL LASFTEAVQA AFLVGVPIAL AGLAAALALK ELPLRTSSAP AEDRRPPAAA APPADAGRAA GAVSHRRT
|
| |