Gene Noca_4048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4048 
Symbol 
ID4596562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4271226 
End bp4272896 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content74% 
IMG OID639778654 
Producthypothetical protein 
Protein accessionYP_925232 
Protein GI119718267 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCCT CCTCGCACCC ACCCCTGTCG CGCTACGCGG CCCTGGCGCG GCTGCTGGTC 
CGGCACCGTC GCAGCGACCT CCTGTCCGGA CTGGGCCTGG ACGGGTTCCC GCGCGACGAC
GACGCCCCCG AGGGCGACGA GGCCCTCGCG GAGCGGTTCG CGGCCGACCT CGAGGAGCTC
GGGCCGACGT TCATCAAGCT CGGCCAGCTG CTCTCGACCC GGTTCGACCT GCTGCCCCCG
GCGTACACGA CCGCGCTCGC CCGGCTCCAG GACGACGCCG AGCCGGCGGA CTTCGACAGC
CTCCGCGAGG TCGTGGAGGC CGAGCTCGGC GGCCGGATCG GCGACCTGTA CGGGTCCTTC
GATCCCGAGC CGCTGGCGGC CGCATCGCTC GGCCAGGTGC ACCGCGCGAC GCTGCGCAAC
GGTCGTGAGG TCGTCGTCAA GGTGCAGCGT CCCGGCGTCC GGGAGCAGGC GCGGGAGGAC
ATGGAGACGC TCGCGCGGCT CGCGGGCCTG GCCGACCGAC ACACCGACAC CGGCCGCCGG
TTCGGCTTCG AGCAGCTGCT CGCCGAGTTC CGCCGCTCGC TGAGCGGTGA GCTCGACTAC
CGCCGCGAGG CGCGCAACCT GCGCCGGTTC CGCGAGCTCA CGTCCGACTA CGACCTCCTG
GTGGTGCCGG CTCCGGTCGA GGAGCTGTCG ACGTCGCGGG TGCTGACGAT GGACCGGGTC
GACGGGCGCA AGGTCACCGA CGTCGGACCG CTGGGCCTGC TCGACCTCGA CACCCGGCCG
ATGGTGGAGC AGCTGTTCGG CTGCTACCTG GACGCGATGC TGCGCCACGG CTTCCTGCAC
GCCGACCCGC ACCCGGGCAA CATGCTGGTC ACCGACGACG ACCGGCTGGC GCTGCTCGAC
CTCGGGATGG TCACGACCGT CCCGCCGCGG CTGCGCGACC AGGTCACCAA GCTGCTGCTC
GCGCTGGGCG ACGGGGACGG CGACGAGGCG GCCGCGGTGC TGGCCGCCCT CGGCCACCCG
CTGCAGGACT TCGACGCGGC CGCGTTCCGT GCGGACGTCG GGCAGCTCGT CTCGGAGGCG
ACCAGCGCCG GGTCCGACGT CCAGGCCGGC TCGGCCCTGG TCCAGCTGAG CCAGGTCTCC
GGGCGGCACG GGCTGCGCCC GCCCGCCGAG ATGTCGATGG TCGGCAAGGC GCTGCTGAAC
CTGGACCAGG TCACGCTGCA CCTCGACCCG ACGTTCGACC CGGCGGCCGC GGTGCGCGAC
AGCCTGGTGT CGCTGCTGCG CAGCGGCCTG AGCTTCAGCG CCGCCGGGAT GATGAGCGCG
GCGCTGGAGG CCAAGGAGTT CACCGCCAAC CTGCCGCGGC GCGGCAACCG GATCCTGGAG
GCGCTCTCCG AGGGCGAGCT CAGCATCCGG GTGCACGCGG TCGACGAGGA GCGGCTGCAC
ACGGTGCTGC ACCGGGTCGC GAACCGGCTC ACGTTCGGGA TCGTGATCGC GGCGACCGTG
ATCGGGGCGG CGATGATGAT GCGGGTGCCG ACCGAGCACG AGGTGCTGGG CTACCCGGCG
GTGGCGATGG CCTTCTTCGT GTTCGCGGTG CTCAGCGGCG GCGGGCTGAT CGCGTGGGTG
CTGCTCACCG ACCGGCGGGT GGCCCGGGAG CGGCGTCAGG ACCCGGGCTG A
 
Protein sequence
MSSSSHPPLS RYAALARLLV RHRRSDLLSG LGLDGFPRDD DAPEGDEALA ERFAADLEEL 
GPTFIKLGQL LSTRFDLLPP AYTTALARLQ DDAEPADFDS LREVVEAELG GRIGDLYGSF
DPEPLAAASL GQVHRATLRN GREVVVKVQR PGVREQARED METLARLAGL ADRHTDTGRR
FGFEQLLAEF RRSLSGELDY RREARNLRRF RELTSDYDLL VVPAPVEELS TSRVLTMDRV
DGRKVTDVGP LGLLDLDTRP MVEQLFGCYL DAMLRHGFLH ADPHPGNMLV TDDDRLALLD
LGMVTTVPPR LRDQVTKLLL ALGDGDGDEA AAVLAALGHP LQDFDAAAFR ADVGQLVSEA
TSAGSDVQAG SALVQLSQVS GRHGLRPPAE MSMVGKALLN LDQVTLHLDP TFDPAAAVRD
SLVSLLRSGL SFSAAGMMSA ALEAKEFTAN LPRRGNRILE ALSEGELSIR VHAVDEERLH
TVLHRVANRL TFGIVIAATV IGAAMMMRVP TEHEVLGYPA VAMAFFVFAV LSGGGLIAWV
LLTDRRVARE RRQDPG