Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gobs_3001 |
Symbol | |
ID | 8754674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geodermatophilus obscurus DSM 43160 |
Kingdom | Bacteria |
Replicon accession | NC_013757 |
Strand | - |
Start bp | 3136322 |
End bp | 3138022 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | CTP synthase |
Protein accession | YP_003409982 |
Protein GI | 284991428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.427451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGGTC TCCTCCACAA CTCCCAGCCG ACGAAGTTCG TCTTCGTCAC CGGCGGAGTC GTCTCCTCCC TCGGCAAGGG CCTGACGGCC AGCTCCCTCG GGGCCCTGCT GTCCAGCCGT GGCCTGCGGG TCACGATGCA GAAGCTGGAC CCCTACCTCA ACGTCGACCC CGGCACGATG AACCCGTTCC AGCACGGCGA GGTCTTCGTC ACCGAGGACG GCGCCGAGAC CGACCTGGAC ATCGGGCACT ACGAGCGGTT CCTCGACACC GACCTCACCG GCCGGTCGAA CGTGACCACC GGCCAGGTGT ACTCCGAGGT GATCGCCAAG GAGCGCCGCG GGGAGTACCT CGGCGACACC GTGCAGGTCA TCCCGCACAT CACCAACGAG ATCAAGTCGC GCATCATGGC CGCGGCCGAG GGCCCCGAGC GCGTGGACGT CGTCATCACC GAGGTCGGCG GCACCGTCGG CGACATCGAG TCGCTGCCCT TCCTCGAGGC CGCCCGCCAG GTGCGCCACG AGATCGGCCG GGACAACTGC TTCTTCCTGC ACATCTCGCT GGTGCCCTAC ATCGCGCCGT CCGGTGAGCT GAAGACCAAG CCGACCCAGC ACTCGGTCGC CGCGCTGCGC AACATCGGCA TCCAGCCCGA CGCGCTGGTC TGCCGCTCGG ACCGGGAGAT CGGCACCGGG CTCAAGCGCA AGATCAGCCT GATGTGCGAC GTCGACGCCG AGGGCGTCAT CTCCTGCCCC GACGCGCCGT CGATCTACGA CATCCCCAAG GTGCTGCACC GCGAGGGCCT GGACGCCTAC GTCGTCCGCC GGCTGGGCCT GCCGTTCCGT GACGTCGACT GGACCGTGTG GGGCGACCTG CTCGACCGGG TCCACGCGCC GAAGCAGACG GTGACCATCG CGCTGGTCGG CAAGTACATC GACCTGCCCG ACGCCTACCT GTCGGTGACC GAGGCGCTGC GGGCCGGCGG GTTCGCGCAC CGCAGCCGGG TGCAGATCCG CTGGGTGCCC TCCGACGACT GCCAGACCCC CGAGGGCGCC GAGCGGGCGC TGGCCGGCGT CGACGGCGTC TGCATCCCGG GCGGGTTCGG CGTCCGCGGC ATCGACGGCA AGCTGGGCGC GATCCGGCAC GCGCGGGTCA ACGGCGTCCC GCTGCTGGGC CTGTGCCTGG GCCTGCAGTG CATGGTCATC GAGGCGGCGC GCAACCTGGC CGGGTTGCCG GAGGCCAACT CCGCCGAGTT CGACGCGGAC ACCCCCGACG CGGTCATCGC GACGATGGCC AGCCAGGTCG ACGTCGTCGC CGGGCGGGGC GACATGGGCG GCACGATGCG GCTGGGCAGC TACCCGGCGT CCCTGCAGAA GGGCTCGGTC GTCGCGCAGG CCTACGGCGC ATGCGAGATC ACCGAGCGGC ACCGGCACCG CTACGAGGTG GCCAACGCCT ACCGCGACCG GATCGGCGAG GCCGGGTTGG TCTTCTCCGG CACCTCGCCC GACGGCCTGC TGGTCGAGTT CGCCGAGCTG CCGCGCGAGG TGCACCCGTT CTTCGTCGGC ACCCAGGCGC ACCCGGAGCT CAAGAGCCGC CCGACCCGGC CGCACCCGCT GTTCGCGGCG TTCGTGCAGG CGGCGATCGA CTTCTCCGAG TCCGCCCGGC TGCCGGTGCC GATCGACGAG GCCGAGAAGG TGGGGATCTG A
|
Protein sequence | MRGLLHNSQP TKFVFVTGGV VSSLGKGLTA SSLGALLSSR GLRVTMQKLD PYLNVDPGTM NPFQHGEVFV TEDGAETDLD IGHYERFLDT DLTGRSNVTT GQVYSEVIAK ERRGEYLGDT VQVIPHITNE IKSRIMAAAE GPERVDVVIT EVGGTVGDIE SLPFLEAARQ VRHEIGRDNC FFLHISLVPY IAPSGELKTK PTQHSVAALR NIGIQPDALV CRSDREIGTG LKRKISLMCD VDAEGVISCP DAPSIYDIPK VLHREGLDAY VVRRLGLPFR DVDWTVWGDL LDRVHAPKQT VTIALVGKYI DLPDAYLSVT EALRAGGFAH RSRVQIRWVP SDDCQTPEGA ERALAGVDGV CIPGGFGVRG IDGKLGAIRH ARVNGVPLLG LCLGLQCMVI EAARNLAGLP EANSAEFDAD TPDAVIATMA SQVDVVAGRG DMGGTMRLGS YPASLQKGSV VAQAYGACEI TERHRHRYEV ANAYRDRIGE AGLVFSGTSP DGLLVEFAEL PREVHPFFVG TQAHPELKSR PTRPHPLFAA FVQAAIDFSE SARLPVPIDE AEKVGI
|
| |