Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_3012 |
Symbol | |
ID | 8604356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | - |
Start bp | 3497487 |
End bp | 3500810 |
Gene Length | 3324 bp |
Protein Length | 1107 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | carbamoyl-phosphate synthase, large subunit |
Protein accession | YP_003300592 |
Protein GI | 269127222 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.541011 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCGCC GCCAAGATCT CAAGTCGGTC CTGGTGATCG GCTCCGGGCC GATCGTCATC GGCCAGGCAT GTGAATTCGA CTACTCCGGA ACCCAGGCGT GCCGGGTGCT CAAGTCCGAG GGGCTGCGCG TCGTCCTGGT CAACAGCAAC CCGGCGACGA TCATGACCGA CCCGGAGTTC GCCGACGCCA CCTATGTCGA GCCGATCACC CCGGACGTGG TCGAGAAGAT CATCGCCAAG GAGCGGCCGG ACGCGCTGCT GCCCACCCTG GGCGGCCAGA CCGCCCTCAA CACCGCGATC GCGCTGCACG AGTCGGGGGT GCTGGAGCGC TACGGGGTGG AGCTGATCGG CGCCGACATC GAGGCCATCC AGGCCGGGGA GAACCGCGAG CGCTTCAAGG AGGTCGTCGC CGCGGTCGCC GCCAAGTACG GCCTGAACGC CGAGTCGGCC CGCTCGGTGA TCTGCCACAG CATGGACGAG TGCCTGGCCG CCGCCGCCGA ACTGGGCTAC CCGCTGGTGG TGCGGCCCTC CTTCACCCTC GGCGGCACCG GCTCGGGCAT GGCCTACAGC GAAGCGGACC TGCGCCGCAT CGCCGGCGCC GGGCTGGCCG CCAGCCCCAC CAGCGAGGTG CTCCTGGAGG AGTCCATCCT CGGCTGGAAG GAGTACGAGC TGGAGGTGAT GCGCGACCGG GCCGACAACG TCGTCATCGT GTGCTCCATC GAGAACCTGG ACCCGATGGG CGTGCACACC GGCGACTCCA TCACCGTCGC CCCGGCGATG ACGCTGACCG ACCGCGAGTA CCAGAACATG CGGGACGTGG CGATCGCGGT GATCCGCGAG GTCGGGGTGG ACACCGGCGG CTGCAACATC CAGTTCGCGG TGCACCCCGA GACCGGGCGG ATGATCGTCA TCGAGATGAA CCCGCGGGTC TCCCGCTCCT CGGCGCTGGC CTCCAAGGCC ACCGGCTTCC CGATCGCCAA GATCGCCGCC AAGCTGGCCA TCGGCTACAC CCTCGATGAG ATCCCCAACG ACATCACCCG CGAGACCCCG GCCAGCTTCG AGCCCACCCT CGACTACGTC GTGGTCAAGG TGCCCCGCTT CGCCTTCGAG AAGTTCCCCG GCGCCGACGC CACGCTGACC ACCCACATGA AGTCGGTGGG CGAGGCCATG GCGATCGGCC GGTCCTTCCC CGAGGCGCTG CAAAAGGCGC TGCGGTCGCT GGAGCAGAAG GGCTCGTCCT TCTCCTGGGA CGGCGAGCCC GGCGACCCGC AGGAGCTGCT GCGCAGCGCC GGCCGGCCGC ACGAGGGCCG GCTGCGCGAT GTGCAGCGGG CGCTGTGGGC CGGCGCCACC GTCGAGCAGG TCCACCGGGC CACCGGCATC GACCCGTGGT TCCTGGAGCA GATCGCCGCC ATCAACGAGG TCGCCGACCA GATCCGCACC GCCGAGGACG CGCTGACCCG CGACAAGCTG CTGACCGCCA AGCGCTACGG CTTCTCCGAC GCCCAGATCG GGCAGTTGCG GGGCCTGCCG GAGGAGGTGG TCCGGGAGCT GCGGCGGGCG CTGGGCGTGC GGCCGGTCTA CCACACCGTG GACACCTGCG CCGCGGAGTT CGCCGCCCGC ACCCCCTACC TGTACTCCAC CTACGACGAG GAGACCGAGG TCCCCACCGG GGACAAGCCC AAGGTCATCA TCCTCGGCAG CGGCCCCAAC CGCATCGGCC AGGGCGTGGA GTTCGACTAC TCCTGCGTGC ACGCTTCCTT CACCCTCTCG GAGGCCGGTT ACGAGACCGT CATGGTCAAC TGCAACCCCG AGACGGTCTC CACCGACTAC GACACCTCCG ACCGGCTGTA CTTCGAGCCG CTCACCTTGG AGGACGTGCT GGAGGTGGTG CACGCCGAGC AGCAGACCGG CCCGGTCGCC GGGGTGATCG TCCAGCTGGG CGGGCAGACC CCGCTGGGGC TGGCCCAAAA GCTCAAGGAC GCCGGCGTGC CGATCGTGGG CACCTCGCCG GAGAGCATCC ACCTGGCCGA GGACCGCGGC GCCTTCGGCC GGGTGCTGGA GCGGGCCGGG CTGCCGGCTC CCAAGCACGG CACCGCCACC TCGTTCGAGG AGGCCCGCAA GATCGCCGCC GAGATCGGCT ACCCGGTGCT GGTGCGCCCC TCCTACGTGC TGGGCGGACG CGGCATGGAG ATCGTCTACG ACGACGCCAC GCTGCAGTCC TACATGGCCA AGGCCACCGA GGTCAGCCCC GAGCACCCGG TGCTGGTGGA CCGGTTCCTC GATGAGGCGG TCGAGATCGA CGTCGATGCC CTCTTCGACG GCGAGGAGCT GTACCTGGGC GGGATCATGG AGCACATCGA GGAGGCCGGG ATCCACTCCG GCGACTCGGC CTGCGCCCTG CCCCCCATCA CGCTGGGCCG CGAGGACATC GAGCGGATCC GCACCTCCAC CGAGGCGCTG GCCCGCGGCA TCGGGGTGCG CGGCCTGATC AACGTCCAGT ACGCCCTGTC GGCGGGGGTG CTGTACGTGC TGGAGGCCAA CCCGCGCGCC TCCCGCACCG TCCCGTTCGT CTCCAAGGCC ACCGCGGTCC CCCTCGCCAA GGCCGCCGCC CGGGTGATGA TGGGCGCCAC CATCGCCGAG CTGCGCGCCG AGGGGCTGCT GCCGCGCGAG GGCGACGGCG GCACGCTGCC GCTGGACGCC CCCATCGCGG TCAAGGAGGC GGTGCTGCCC TTCGACCGGT TCCGCAACGC CCAGGGCCAG GGCGTGGACA TCGTGCTCGG CCCGGAGATG CGCTCCACCG GCGAGGTCAT GGGCATCGAC GAGACCTTCG GCACCGCCTT CGCCAAGTCC CAGCAGGCCG CCTACGGGGC GCTGCCGACC AAGGGACGGG CGTTCGTGTC GGTGGCCAAC CGGGACAAGC GCTCGATGGT CTTCCCCGTC AAGCGCCTGG CCGACCTGGG CTTTGAGATC CTGGCCACCG AGGGCACCGC CGAGGTGCTG CGCCGCAACG GCGTGCATGC CAAGATCGTG CGAAAGCACA GCGAAGGGCC CGGTCCCGAC GGCGAGCCCA CCATCGTCCG GCGCGTCCTC GACGGGGAGG TGGACCTCAT CGTCAATACT CCCTTCGGCA GCCCCGGCCA ATCGGGGCCG CGACTGGACG GCTATGAGAT CCGCACCGCC GCGGTGCTGC GGGGCATCCC GTGCGTGACC ACCACGGCCG GACTGGCCGC CGCGGTGCAG GGCATCGAGG CCATCGTCCG CGGCGACCTG GGCGTCCGCT CGCTGCAGGA ACACGCCGAA CGCCTGCGGG CCGCCCGCCG GTGA
|
Protein sequence | MPRRQDLKSV LVIGSGPIVI GQACEFDYSG TQACRVLKSE GLRVVLVNSN PATIMTDPEF ADATYVEPIT PDVVEKIIAK ERPDALLPTL GGQTALNTAI ALHESGVLER YGVELIGADI EAIQAGENRE RFKEVVAAVA AKYGLNAESA RSVICHSMDE CLAAAAELGY PLVVRPSFTL GGTGSGMAYS EADLRRIAGA GLAASPTSEV LLEESILGWK EYELEVMRDR ADNVVIVCSI ENLDPMGVHT GDSITVAPAM TLTDREYQNM RDVAIAVIRE VGVDTGGCNI QFAVHPETGR MIVIEMNPRV SRSSALASKA TGFPIAKIAA KLAIGYTLDE IPNDITRETP ASFEPTLDYV VVKVPRFAFE KFPGADATLT THMKSVGEAM AIGRSFPEAL QKALRSLEQK GSSFSWDGEP GDPQELLRSA GRPHEGRLRD VQRALWAGAT VEQVHRATGI DPWFLEQIAA INEVADQIRT AEDALTRDKL LTAKRYGFSD AQIGQLRGLP EEVVRELRRA LGVRPVYHTV DTCAAEFAAR TPYLYSTYDE ETEVPTGDKP KVIILGSGPN RIGQGVEFDY SCVHASFTLS EAGYETVMVN CNPETVSTDY DTSDRLYFEP LTLEDVLEVV HAEQQTGPVA GVIVQLGGQT PLGLAQKLKD AGVPIVGTSP ESIHLAEDRG AFGRVLERAG LPAPKHGTAT SFEEARKIAA EIGYPVLVRP SYVLGGRGME IVYDDATLQS YMAKATEVSP EHPVLVDRFL DEAVEIDVDA LFDGEELYLG GIMEHIEEAG IHSGDSACAL PPITLGREDI ERIRTSTEAL ARGIGVRGLI NVQYALSAGV LYVLEANPRA SRTVPFVSKA TAVPLAKAAA RVMMGATIAE LRAEGLLPRE GDGGTLPLDA PIAVKEAVLP FDRFRNAQGQ GVDIVLGPEM RSTGEVMGID ETFGTAFAKS QQAAYGALPT KGRAFVSVAN RDKRSMVFPV KRLADLGFEI LATEGTAEVL RRNGVHAKIV RKHSEGPGPD GEPTIVRRVL DGEVDLIVNT PFGSPGQSGP RLDGYEIRTA AVLRGIPCVT TTAGLAAAVQ GIEAIVRGDL GVRSLQEHAE RLRAARR
|
| |