Gene Tcur_4135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTcur_4135 
Symbol 
ID8605491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermomonospora curvata DSM 43183 
KingdomBacteria 
Replicon accessionNC_013510 
Strand
Start bp4719168 
End bp4722176 
Gene Length3009 bp 
Protein Length1002 aa 
Translation table11 
GC content76% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003301701 
Protein GI269128331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCAC GCGATGCAGT CGGGATCCGC AGGCGCAGCG CCGGGGAGAT ACTGGCCGGG 
CTGGGGGCGC TGGTGGCGCT GTCGGCGCTG GTCGTCGGGG TGCCGTGCGC GCTGCTGCTG
GCCTTCGGCT CGCCGCTGCC GGAGCGGATG CCCACCCTGG AGGACCTGAC CGGCCGGATC
GGCCCCTCGG CGATCATCAC CGTGCTGGTC ACCCTGGTGT GGCTGGCCTG GCTGCAACTG
GTGGTCTGCG TCCTGGTGGA GGTGCACGCC GGGATCCGCG GCGTCGGGGT GCCGGTCCGG
GTGCCGCTGG CCGGCGGGCT GCAGCCGCTG GTGCACCGGC TGGTGCTGGC CGCGCTGCTG
CTGTTCACCA CCGCCAGTGC GGTCATGCCC GCCTTCTCGG GCCGGGACAT GCCCTCCACC
GCGCCGGTGG CCGCGGTGCA GACCGACCAC TTTCAGCTGG TGGCGGCCAC CACGCCGCTT
GCGCGGGCCG AGCCGTCCGG CACGGCCGAA GAAGGGGCGC GCGCCCTCGC CGAGAGCGTC
GCGGCCGAAC CGACCGAAAA GACGCAGACC ACCAAGATCT ACCGGGTGCA GCCGCCGCAG
GGCCGCCACC ACGAGAGCCT GTGGGAGATC GCCGAGCGGT GCCTGGGCGA GGGCCGCCGC
TACAAGGAGA TCTACGAGCT GAACAAGGGC CGGGTGCAGC CCGACGGCAG CAAGCTGACC
TACGCCAGCC TCATCCGGCC CGGCTGGATC CTGGAGATGC CCGCCGACGC CGTCGGCGTC
CAGGTCGTGC CGGTCGAAGA CCTGGAGGAG TACTTCCGCT ACGGGCACCC CAAGCCCGAC
CCCGAACCCC GCGGCCAGGA CGGGCGGGAA ACACCCCGGC AGGCCCCCGA AGAGGCCCCG
CCCGCACCGT CTGCTCCGCC TTCGCAACCA CCCGCCGCAC AGACCCCCGA GCCGCCCGCT
GAGCAGACCC CTGCGCCGCC CGCCGAACAG GCCCCGCAGC CGCCGGCCGC GCAGGCCCCG
CAGCGTCCCG CCGAGCAGGC CCCCGCCCGG CCGGACGAGC CGGCCCAGCA GCCGCACGCC
CACCGGCTCG AACCCGCCGA CGAGGGGCCG CCGCCGGTCA CCGGGCTGTC GCTGCCGTCG
ATCGACCTGG GCTGGCCGCA CGGGCTGGCG GCGGCCTCGC TGCTGGCCGC CGGCGTGCTG
ACGGCGCTGG GCCGGCGGCG CCGCGAGCAG ATGTGGCACC GCGGATTCGG CACCATGATC
GCCCGTCCCG AGGGGGAGGC CGCCCGCGCC GAGGAGGCGC TGCGCGTCGG CCAGGACCCC
GAGGGAGCCC AGCTGCTGGA CCTGAGCCTG CGCCAGCTGT CACGGGCGCT GGCCGACCGG
GGCCGCACCC TGCCCACCGT CTACGGCGTC CACCTGGGCG CCGAGAGCCT GGACCTGTGG
GTCGCCCCCG CCGACCCCAA CCCGCCCGAG CCCTGGCGCG CCTTCGACGA CGGGCAGGTG
TGGCGGCTGT CGGCCGACAC GCTGCCCGCC CTGCGGGAGG CCGCGCTGGG CGACGTGCTG
GCCCCCTACC CGGGCCTGGT GTCCATCGGC ACCAACGCCA ACGGCCGGAT CCTGGTGGAC
CTGGAGGCCG CCCAGGGGCT GATCGCGGTG CGCGGCCCGG AGGAGACCCG GCGGGCGGCG
CTGGCCGCCA TCGCCCTGGA GCTGGCCACC AACCGCTGGT CGGACCACAT GCGCATCACC
CTGGTCGGCT TCGACCCCGA CCTGGCGCGC AACCTCGCCG AGATCGCCCC CGACCGCATC
CGCACCGTGG CCTCCCTGCA GGAGGCGCTG CCGGAGCTGG AGGGCCGCAG CGAGGAGGTG
CGCCAGGCGC TGGCCGCCTC CGGCGCCGAC TCGGTGCTGA CCGGCCGCTG CCGCGGCGTG
TTCGGCGAGG CATGGATGCC GCACTACCTG ATCATGGCCG ACCAGCCCAC CGACGCCGAG
ACGGCCCGCC TGGTGGCGCT GGCCCGCACC GGCCGGCGCA TGGCCTCCGG CTACCTGGTC
GCCGGCGAGG TGCCGGGCGC CACCTGGACC TGGGACGTCA CCGCCGACGG CCGGCTGCAC
GCCGGGGTGC TGGGCTTCGA CGTGCAGGCC CAGCTGGTGC GCCCCGAGCA CTACCAGGCG
GTGGCCGACC TGTTCCGCAC CGCCTCCCGC ACCCAGGGCG CTCCGCTGCC GGGCCCCGCC
GACGGGGAGC AGCCGCCGTC CTTCGACCAC CGTCCCGAAG TGGACATCCG GCTGCTCGGC
CCGATCGAGG TGGACGCGCC GGGCCCGATG GACGAAAGCC GCCGGGCGCT GTGCACCGAG
GTGCTGGTGT ACCTGGCCAC CCATCCCGGC GGGGTGCACC CCACGGTGCT GAGCGGGGCG
ATCTGGCCGC GCGGCGTCAG CGCCGGGGTG CGCGACGCCT GCATCGCCCG CGTCTCCGAC
TGGCTGGGCC GCGACTCGCG CGGCCGTCCC AACCTCTACT ACGACGAGCG CGGACGCATC
CGGCTCGGCT CGGAGGTGCG GGTGGACTGG TCGGTGTTCC GCTGGCTGGT GTGGCGTTCG
GCGGCCGAAC CGGCCTCCGA GACCGCCTAC CTGTCCTACG CCCTGGACCT GGTGCGCGGC
CCGCTGCTGG CCGACCGGCC GCGCGGCCGC TACGGCTGGC TGGCCGCCGA CCAGCTGGAG
TACGAGGCCA CCGCCCGCGT CATCGACGTC GCCCACCGCC TGGCCGTGCT GCGGCTGGAG
GAGGGCGACG CCCACGGCGC GGTGAACGCC GCCCGGGCCG GGCTGCGCAT GGTCCCCGAC
GACGAGGGCC TGTGGCGCGA CCTGCTGCGC GCCACCCACG CCACCGGCGA CGCCACCCAG
GTGCAGGTGG TGGTGGACGA GCTGCGCCGC AGGCTCGGCC GCGACCCGCT GATGGACCAC
CTGCAGCCGG AGACCGAGGC CCTCATCGAG GAGCTGGTGC CGCACTGGCG TCAGGTCGCC
CACCGGTGA
 
Protein sequence
MSPRDAVGIR RRSAGEILAG LGALVALSAL VVGVPCALLL AFGSPLPERM PTLEDLTGRI 
GPSAIITVLV TLVWLAWLQL VVCVLVEVHA GIRGVGVPVR VPLAGGLQPL VHRLVLAALL
LFTTASAVMP AFSGRDMPST APVAAVQTDH FQLVAATTPL ARAEPSGTAE EGARALAESV
AAEPTEKTQT TKIYRVQPPQ GRHHESLWEI AERCLGEGRR YKEIYELNKG RVQPDGSKLT
YASLIRPGWI LEMPADAVGV QVVPVEDLEE YFRYGHPKPD PEPRGQDGRE TPRQAPEEAP
PAPSAPPSQP PAAQTPEPPA EQTPAPPAEQ APQPPAAQAP QRPAEQAPAR PDEPAQQPHA
HRLEPADEGP PPVTGLSLPS IDLGWPHGLA AASLLAAGVL TALGRRRREQ MWHRGFGTMI
ARPEGEAARA EEALRVGQDP EGAQLLDLSL RQLSRALADR GRTLPTVYGV HLGAESLDLW
VAPADPNPPE PWRAFDDGQV WRLSADTLPA LREAALGDVL APYPGLVSIG TNANGRILVD
LEAAQGLIAV RGPEETRRAA LAAIALELAT NRWSDHMRIT LVGFDPDLAR NLAEIAPDRI
RTVASLQEAL PELEGRSEEV RQALAASGAD SVLTGRCRGV FGEAWMPHYL IMADQPTDAE
TARLVALART GRRMASGYLV AGEVPGATWT WDVTADGRLH AGVLGFDVQA QLVRPEHYQA
VADLFRTASR TQGAPLPGPA DGEQPPSFDH RPEVDIRLLG PIEVDAPGPM DESRRALCTE
VLVYLATHPG GVHPTVLSGA IWPRGVSAGV RDACIARVSD WLGRDSRGRP NLYYDERGRI
RLGSEVRVDW SVFRWLVWRS AAEPASETAY LSYALDLVRG PLLADRPRGR YGWLAADQLE
YEATARVIDV AHRLAVLRLE EGDAHGAVNA ARAGLRMVPD DEGLWRDLLR ATHATGDATQ
VQVVVDELRR RLGRDPLMDH LQPETEALIE ELVPHWRQVA HR