Gene Noca_3400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3400 
Symbol 
ID4598198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3603372 
End bp3605057 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content69% 
IMG OID639778006 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_924587 
Protein GI119717622 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.557374 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCC ACCCGAGCCA CAGCACCGTC CACCGCACCG GATCGCGCGC CGACATCCGC 
GTGCCCTTCA CCCGGGTCGC GCTCACCAAC GGCGAGACGT TCGACCGGTA CGCCACTGCG
GGACCCGGCA GCGACCCCGA GGTCGGGCTG CCGCCCCTGC GCGCTGACTG GATCGCCCAG
CGCGGCGACA CCGAGGAGTA CGCCGGCCGG GAGACCCAGC TGCTCGACAA CGGCAGGGCC
GCCATCCGAC GGGGCGAGGC ACGCGACCAG TGGCGAGGGA CCAGAAGCCG GCCGCGGCGC
GGTACCAGCA CCGTCACCCA GATGCACTAC GCCCGGCAGG GCGTGGTGAC CCCGGAGATG
GAGTACGTCG CGATCCGCGA GGGCTGCAAC GTCGACCTGG TCCGCTCCGA GGTCGCCGCC
GGGCGCGCGA TCATCCCCGC CAACCTCAAC CACCCCGAGG CCGAGCCGAT GATCATCGGG
CGCCGGTTCC TGGTCAAGGT GAACGCCAAC ATCGGCAATT CCGCGGTCAC CAGCTCGATC
GCCGAGGAGG TCGACAAGCT CACCTGGGCG GTCACCTGGG GTGCCGACAC CGTCATGGAC
CTGTCCACCG GCGACGACAT CCACACCACC CGGGAATGGA TCATCCGCAA CTCACCGGTC
CCGATCGGGA CCGTCCCGAT CTACCAGGCC CTGGAGAAGG TCGACGGCGA CGCCAGCCGG
CTGACCTGGG AGATCTTCCG AGACACCGTC ATCGAGCAGT GCGAACAGGG CGTGGACTAC
ATGACCATCC ATGCCGGGGT GCTGCTGCGC TACGTGCCAC TGACCGCGCA GCGCATCACC
GGGATCGTCT CCCGCGGCGG GTCGATCATG GCCGGCTGGT GCCTGGCGCA CCACCAGGAG
AACTTCCTCT ACACGCACTT CGACGAGCTG TGCGAGATCT TCGCACGCTA CGACGTGTCC
TTCTCGCTCG GCGACGGCCT GCGCCCCGGT TGCACAGCCG ATGCGAACGA CGAGGCGCAG
CTCTCCGAGC TGCGTACCCT GGCCGAGCTC ACCCAGCGTG CCTGGGAGCA CGACGTCCAG
GTGATGGTGG AAGGACCTGG GCACGTGCCG CTCAACCTGG TCGAGGAGAA CGTCGTCCTG
CAGCAGGACT GGTGCCACGG CGCCCCGTTC TACACCCTCG GCCCGCTGGC CACCGACATC
GCACCCGGCT ACGACCACAT CACCTCCGCG ATCGGCGCGG CGGCCATCGC CATGCACGGC
ACCGCCATGC TCTGCTACGT CACCCCCAAG GAGCACCTCG GACTGCCGAA CCGCGACGAC
GTCAAGACCG GCGTGATCAC CTACAAGCTC TCCGCGCACG CCGCCGACGT CGCCAAGGGC
CACCCCGGAG CCCGCGACTG GGACGACGCC TTGTCCAAGG CCCGCTTCGA GTTCCGCTGG
CACGACCAGT TCGCCCTCTC CCTCGACCCG CACACCGCCG AGTCCTTCCA CGACGAGACG
CTCCCGGCCG AGGCCAGCAA GACGGCGCAC TTCTGCTCCA TGTGCGGCCC GAAGTTCTGC
TCGATGCGCA TCAGCCAGGA CGTACGCGAC TACGTCACCT CAGGCATGGC CGAGAAGTCG
GCGCAGTTCC TGGAGCTGGG CTCCTCGGTC TACGTCGAGG GTGATACCTA CACCGCGGCA
CCGTGA
 
Protein sequence
MQIHPSHSTV HRTGSRADIR VPFTRVALTN GETFDRYATA GPGSDPEVGL PPLRADWIAQ 
RGDTEEYAGR ETQLLDNGRA AIRRGEARDQ WRGTRSRPRR GTSTVTQMHY ARQGVVTPEM
EYVAIREGCN VDLVRSEVAA GRAIIPANLN HPEAEPMIIG RRFLVKVNAN IGNSAVTSSI
AEEVDKLTWA VTWGADTVMD LSTGDDIHTT REWIIRNSPV PIGTVPIYQA LEKVDGDASR
LTWEIFRDTV IEQCEQGVDY MTIHAGVLLR YVPLTAQRIT GIVSRGGSIM AGWCLAHHQE
NFLYTHFDEL CEIFARYDVS FSLGDGLRPG CTADANDEAQ LSELRTLAEL TQRAWEHDVQ
VMVEGPGHVP LNLVEENVVL QQDWCHGAPF YTLGPLATDI APGYDHITSA IGAAAIAMHG
TAMLCYVTPK EHLGLPNRDD VKTGVITYKL SAHAADVAKG HPGARDWDDA LSKARFEFRW
HDQFALSLDP HTAESFHDET LPAEASKTAH FCSMCGPKFC SMRISQDVRD YVTSGMAEKS
AQFLELGSSV YVEGDTYTAA P