Gene Noca_4023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4023 
Symbol 
ID4596537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4248915 
End bp4250480 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content74% 
IMG OID639778629 
Productoxidoreductase, molybdopterin binding 
Protein accessionYP_925207 
Protein GI119718242 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.351273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACCT CCCGCGCGTG GTCGGTCGCC GGCCTGGTCG CCGGGTTCGC CGGTCTCGCC 
GTGAGCTACA GCCTCGCGAT GGTGATGACG ATCCGCGACT CGCCGTTGCT CGCCGTCGCC
GACCTGGTCG TCCGGCTCAC CCCCGGGCCG GTCGTCGAGC GCGCGATCCG GTTCTTCGGC
CACCACGACA AGACGGTGCT GCTGGTCGTG CTGCTGCTGA TCAGCGCGGC GGTGTTCGCC
TGGGCCGGCC GGCTGGCGCG GCACTCGTGG TGGCGCTCCA CCCTGGTGTT CGCCGGGCTT
GCCGTCATCG GTGGCGTCGC GGTGTGGCTC CAGCGCGGCG CCACGGCGCT CGACGGGTTG
CCGGTCGCCG CCGGGTTCGC GACCTGGGTG GTCTGCCTCT CGCTGCTGAC CGAGCCGCTG
CGGCGGCACA CACGGGCCCT CGCCGCGGCG CCGGAGGACG TCACGGACCC GGCGCACCCC
GACCACACCC GCCGTACCTT CCTGCTGCGG GTCGGCGCCA TCGCGGCGGT GGGCGTCGTG
GCGGCCGCGG CCGGACGGGT GGTCGGCCGC GGCCGGCGGC ACGTCGAGGC GAGCCGCCGG
CTGCTGCGGC TGCCCGGCGT GACCCGGCCG CTGGTGCCGA AGGGAACGCG GGTCGAGGTC
CCCGACATGA CGCCGTGGCT GACGCCGAAC GACGCCTTCT ACCGGATCGA CACCGCTCTC
GTGGTGCCGG CGATCGAGCC CTCGGACTGG CTGCTGCGGA TCCACGGGAT GGTCGACCGG
GAGATCGTGC TGACCTACCG GGACCTGATC GACCGGCAGC TCACCGAGGC CTGGGTGACG
CTCAACTGCG TCTCCAACGA GGTCGGCGGC GACCTGATCG GCAACGCCTG GTGGAGCGGC
GTCCGGATCG CCGGGCTGCT GAGCGCCGCG GGGGTGCATG CCGGTGCGGA CGCCGTCTTG
CAGACGTCCG AGGACGGCTG GACCTGCGGC ACCCCGCTCG ACGTGCTCAC CGACGACCGG
GACGCGATGC TGGCGGTGGC GATGAACGGC CGGCCGCTGC CGATCGACCA CGGGTTCCCG
GTGCGCACGA TCGTGCCCGG GCTGTACGGC TACGTGTCGG CGTGCAAGTG GGTCGTGGAC
ATGTTGGTGA CCCGGTTCGC CGACATCGAG GCCTACTGGA CCCAGAAGGG CTGGTCCGAG
CTCGGGCCGG TGAAGATCGC CTCGCGCATC GACGTCCCCC GCGACGGCGG CGAGATCGAC
TCCGGCGGAG CCCGGGTGGC CGGCGTCGCC TGGGCGCAGC ACACCGGCAT CTCGCGGGTG
GAGGTGTCGG TCGACGGCGG GAGCTGGCTC CCGGCCCGGC TGGCGCGGGT GCCGAGCAAC
GACACCTGGG TGCAGTGGGT CGCGGACCTC GACGTACCCC CAGGCGAGCA CACGCTGACC
GTGCGGGCCA CCGATGCCGT GGGCCTGCTG CAGACGGGCG TCGAGCAGGA TGTGCTTCCG
GACGGTGCCA CCGGCTGGCA CACGATCGAC CTCACCGCCC GCGAGCCCCA GCAGGAGGAC
GGCTGA
 
Protein sequence
MSTSRAWSVA GLVAGFAGLA VSYSLAMVMT IRDSPLLAVA DLVVRLTPGP VVERAIRFFG 
HHDKTVLLVV LLLISAAVFA WAGRLARHSW WRSTLVFAGL AVIGGVAVWL QRGATALDGL
PVAAGFATWV VCLSLLTEPL RRHTRALAAA PEDVTDPAHP DHTRRTFLLR VGAIAAVGVV
AAAAGRVVGR GRRHVEASRR LLRLPGVTRP LVPKGTRVEV PDMTPWLTPN DAFYRIDTAL
VVPAIEPSDW LLRIHGMVDR EIVLTYRDLI DRQLTEAWVT LNCVSNEVGG DLIGNAWWSG
VRIAGLLSAA GVHAGADAVL QTSEDGWTCG TPLDVLTDDR DAMLAVAMNG RPLPIDHGFP
VRTIVPGLYG YVSACKWVVD MLVTRFADIE AYWTQKGWSE LGPVKIASRI DVPRDGGEID
SGGARVAGVA WAQHTGISRV EVSVDGGSWL PARLARVPSN DTWVQWVADL DVPPGEHTLT
VRATDAVGLL QTGVEQDVLP DGATGWHTID LTAREPQQED G