Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3400 |
Symbol | |
ID | 4598198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 3603372 |
End bp | 3605057 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639778006 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_924587 |
Protein GI | 119717622 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.557374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATCC ACCCGAGCCA CAGCACCGTC CACCGCACCG GATCGCGCGC CGACATCCGC GTGCCCTTCA CCCGGGTCGC GCTCACCAAC GGCGAGACGT TCGACCGGTA CGCCACTGCG GGACCCGGCA GCGACCCCGA GGTCGGGCTG CCGCCCCTGC GCGCTGACTG GATCGCCCAG CGCGGCGACA CCGAGGAGTA CGCCGGCCGG GAGACCCAGC TGCTCGACAA CGGCAGGGCC GCCATCCGAC GGGGCGAGGC ACGCGACCAG TGGCGAGGGA CCAGAAGCCG GCCGCGGCGC GGTACCAGCA CCGTCACCCA GATGCACTAC GCCCGGCAGG GCGTGGTGAC CCCGGAGATG GAGTACGTCG CGATCCGCGA GGGCTGCAAC GTCGACCTGG TCCGCTCCGA GGTCGCCGCC GGGCGCGCGA TCATCCCCGC CAACCTCAAC CACCCCGAGG CCGAGCCGAT GATCATCGGG CGCCGGTTCC TGGTCAAGGT GAACGCCAAC ATCGGCAATT CCGCGGTCAC CAGCTCGATC GCCGAGGAGG TCGACAAGCT CACCTGGGCG GTCACCTGGG GTGCCGACAC CGTCATGGAC CTGTCCACCG GCGACGACAT CCACACCACC CGGGAATGGA TCATCCGCAA CTCACCGGTC CCGATCGGGA CCGTCCCGAT CTACCAGGCC CTGGAGAAGG TCGACGGCGA CGCCAGCCGG CTGACCTGGG AGATCTTCCG AGACACCGTC ATCGAGCAGT GCGAACAGGG CGTGGACTAC ATGACCATCC ATGCCGGGGT GCTGCTGCGC TACGTGCCAC TGACCGCGCA GCGCATCACC GGGATCGTCT CCCGCGGCGG GTCGATCATG GCCGGCTGGT GCCTGGCGCA CCACCAGGAG AACTTCCTCT ACACGCACTT CGACGAGCTG TGCGAGATCT TCGCACGCTA CGACGTGTCC TTCTCGCTCG GCGACGGCCT GCGCCCCGGT TGCACAGCCG ATGCGAACGA CGAGGCGCAG CTCTCCGAGC TGCGTACCCT GGCCGAGCTC ACCCAGCGTG CCTGGGAGCA CGACGTCCAG GTGATGGTGG AAGGACCTGG GCACGTGCCG CTCAACCTGG TCGAGGAGAA CGTCGTCCTG CAGCAGGACT GGTGCCACGG CGCCCCGTTC TACACCCTCG GCCCGCTGGC CACCGACATC GCACCCGGCT ACGACCACAT CACCTCCGCG ATCGGCGCGG CGGCCATCGC CATGCACGGC ACCGCCATGC TCTGCTACGT CACCCCCAAG GAGCACCTCG GACTGCCGAA CCGCGACGAC GTCAAGACCG GCGTGATCAC CTACAAGCTC TCCGCGCACG CCGCCGACGT CGCCAAGGGC CACCCCGGAG CCCGCGACTG GGACGACGCC TTGTCCAAGG CCCGCTTCGA GTTCCGCTGG CACGACCAGT TCGCCCTCTC CCTCGACCCG CACACCGCCG AGTCCTTCCA CGACGAGACG CTCCCGGCCG AGGCCAGCAA GACGGCGCAC TTCTGCTCCA TGTGCGGCCC GAAGTTCTGC TCGATGCGCA TCAGCCAGGA CGTACGCGAC TACGTCACCT CAGGCATGGC CGAGAAGTCG GCGCAGTTCC TGGAGCTGGG CTCCTCGGTC TACGTCGAGG GTGATACCTA CACCGCGGCA CCGTGA
|
Protein sequence | MQIHPSHSTV HRTGSRADIR VPFTRVALTN GETFDRYATA GPGSDPEVGL PPLRADWIAQ RGDTEEYAGR ETQLLDNGRA AIRRGEARDQ WRGTRSRPRR GTSTVTQMHY ARQGVVTPEM EYVAIREGCN VDLVRSEVAA GRAIIPANLN HPEAEPMIIG RRFLVKVNAN IGNSAVTSSI AEEVDKLTWA VTWGADTVMD LSTGDDIHTT REWIIRNSPV PIGTVPIYQA LEKVDGDASR LTWEIFRDTV IEQCEQGVDY MTIHAGVLLR YVPLTAQRIT GIVSRGGSIM AGWCLAHHQE NFLYTHFDEL CEIFARYDVS FSLGDGLRPG CTADANDEAQ LSELRTLAEL TQRAWEHDVQ VMVEGPGHVP LNLVEENVVL QQDWCHGAPF YTLGPLATDI APGYDHITSA IGAAAIAMHG TAMLCYVTPK EHLGLPNRDD VKTGVITYKL SAHAADVAKG HPGARDWDDA LSKARFEFRW HDQFALSLDP HTAESFHDET LPAEASKTAH FCSMCGPKFC SMRISQDVRD YVTSGMAEKS AQFLELGSSV YVEGDTYTAA P
|
| |