Gene Noca_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2026 
Symbol 
ID4598648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2171263 
End bp2172630 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content71% 
IMG OID639776630 
Producthypothetical protein 
Protein accessionYP_923223 
Protein GI119716258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACAG CACGCGAGTT GGCCGACCTG TCGGAGGAGC GGATCCTCGA CCTGGCCGGC 
GCGTGCTCGG AGACCATCCG CGACGCCGAG ACCGAGCTGC TCCGGCTGGC GTATCAGTGG
GCGATCGTGC ACCCGGCGAA CCGGCTCGAC CCGGTCGAGG CCGACCAGCC TGGTCGCGAA
CGCGCTCGCC AGCTCGGCGG TGAGGGCACC CCGCGGGTCG CCGAGTTCGC GGCGGCGGAG
TTCGGGGCCC GGATCGGCCG CTCGCCGTAT GCGGCGGCGT CTCTGATCGG GGACGCGCTG
GACTTGGAGC ACCGGTTCCC GCGCCTGTGG GCGCGGGTCG AGGCCGGTGA GGTGCGCGCC
TCCTATGCCC GCTACGTCAC CACCAAGACC CGGACCCTCA CCGCCGAGCA GGCGGCCTAC
GTCGATGCTC GCGTCTTCGA GTCCGCGGAC GGGCGGCTGC CCTGGTCCCG GTTCGAGGAG
CTGGTGGCCG GCACGGTCGC CCAGGCCGCC CCCGAGGCGG CCCGGGAGAA GGAGGAGCGC
GCCGCCAAGG CGAGGTTCGC CAAGAAGGTC CGCCGCACCG TCGCCGACGA GACCCACGGG
ATGGCCTCGT TCCTGGTGCA CGCCGACCTG CCCACCATCG AGGCCATCGA CGACTACGTC
ACCCAACGAG CCAAGCAGCT CGCCGACACC CTGCCCGACG CCCCCCACCT GGCCACCGAG
GATGACCGGC GGGTGCACGC GTTCCTGCTG CTGGTCTCCG GCGCGCCGGC CGACACCGAC
CTGGCGGATC TGTTGCCGCA GGTGTGCCTG TACGTGCACA CCTACGCCGA CCCCGGCGCC
GACCGCACCC AGAGTTCCGA GGGGATCGTC CGGGTCGAGG GCCATGGTCC GGTCACCCAG
GAGTGGGTCC GCCGGTTCCT CGGCCCGCAC GCCCGGTTCA CGATCCGTCC GGTCCTCGAC
CTCGCCGGCC AAGCCCCGGT GGATTCCTGG GAGATCCCCG ACCGACATAG GCGGGCCGTG
CATCTGATGA CGCCGGCCGA CACCTTCCCC TTCGCCTCCT GCACCTCACC GGGCATGCAG
GTCGACCACA CCATCCCCTA TCACCAGGGT GGTGTCAGCG GGGTGGGCAA CTACGGGCCG
ATGACCACCC TGCACCACCG GATCAAGACG CATGGCGCGG GTTGGCAGGT CAAGCAGCCG
TTCCCCGGCA TCTATATGTG GCGTGACCCC CACGGCGGCT TCTACCTCGT CGACCACACC
GGCACCCGCC GACTCCCCGG AACCCGACGC CCCCTGGTCG TCGAGCTCTG GCACCCACCC
GCCGGCATCG AGATCGCTCT TGCCGACGAC TACACGCCCG CCGCCTAA
 
Protein sequence
MSTARELADL SEERILDLAG ACSETIRDAE TELLRLAYQW AIVHPANRLD PVEADQPGRE 
RARQLGGEGT PRVAEFAAAE FGARIGRSPY AAASLIGDAL DLEHRFPRLW ARVEAGEVRA
SYARYVTTKT RTLTAEQAAY VDARVFESAD GRLPWSRFEE LVAGTVAQAA PEAAREKEER
AAKARFAKKV RRTVADETHG MASFLVHADL PTIEAIDDYV TQRAKQLADT LPDAPHLATE
DDRRVHAFLL LVSGAPADTD LADLLPQVCL YVHTYADPGA DRTQSSEGIV RVEGHGPVTQ
EWVRRFLGPH ARFTIRPVLD LAGQAPVDSW EIPDRHRRAV HLMTPADTFP FASCTSPGMQ
VDHTIPYHQG GVSGVGNYGP MTTLHHRIKT HGAGWQVKQP FPGIYMWRDP HGGFYLVDHT
GTRRLPGTRR PLVVELWHPP AGIEIALADD YTPAA