Gene Noca_1522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1522 
Symbol 
ID4595673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1616182 
End bp1617318 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content73% 
IMG OID639776120 
Producthypothetical protein 
Protein accessionYP_922723 
Protein GI119715758 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1387] Histidinol phosphatase and related hydrolases of the PHP family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.975012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGTCC ACCGGGTGTG GACGAAAGAA AACTGGGCCA ACCAGCGGGA GAACCGACGC 
GAGAGAGAGG TCAGCGGCGT GCCGGACCAG ACAGCGGGCG ACGAGTACGG CGCCGGGCCG
GTCGCCGCCC TGCGCCGGAT CGCGTTCCTG CTCGAGCGCG GCCGGGAGGA CACCTACAAG
GTCAAGGCGT TCCGCGGTGC CGCTGCCGCG ATCCTGCCGC TGACGGCCGA GCAGGTCGCC
GCGGCGGTCG AGGACGGCAG CCTGACGTCG CTGCCGGGTG TCGGTGCGAG CACGGCCCGG
GTCATCGCCG ATGCCGTGCG CGGGGTGCTG CCGACCCGGC TGGCCGAGCT CGAGCGCGAG
CACGGCGGTG ACCTGGCGAG CGGCGGCCAG GGGCTCCGCG CCGCCCTGCG CGGCGACCTG
CACTCCCACT CCGACTGGTC CGACGGCGGC TCGCCGATCG AGGAGATGGC GTTCACGGCC
ATCGAGCTCG GCCACGACTA CCTGGTGCTC ACCGACCACT CGCCGCGGCT GACCGTCGCG
CACGGCCTCA GCGCCGAGCG GTTGACCCGC CAGCTGGGCG TGGTCGACGC GGTCAACCGG
CACCTCTCCG GGGTCGACGA CTCCTTCACG CTGCTCAAGG GGATCGAGGT CGACATCCTC
GACGACGGCT CGCTGGACCA GGACGACGAC CTGCTGGCGC AGCTCGACGT CCGGGTCGCG
AGCGTGCACT CCAAGCTCAA GATGGAGCCG GCGGACATGA CCCGGCGGAT GATCGGCGCG
ATCCGCAACC CGCGCACCAA CGTCCTCGGT CACTGCACCG GCCGGCTGGT GACCGGCAAC
CGCGGCACCC GCCCGGGCTC CCGTTTCGAC GCGGGTGCGG TGTTCGAGGC CTGCGCGGAG
CACGACGTCG CGGTCGAGAT CAACTCCCGG CCCGAGCGGC GGGACCCGCC GACGGCGCTG
CTGGAGCTGG CCCGGGACGC CGGCTGCGTG TTCTCCATCG ACAGCGACGC CCACGCCCCC
GGGCAGCTGG ACTTCCTGGT CTACGGCTGC GAGCGGGCCG AGGCGGCCGG CATCGACCCG
GACCGGATCG TCAACACCTG GCCGCGGGAG CGGTTGCTGG CCTGGGCGAG GAAGTAG
 
Protein sequence
MAVHRVWTKE NWANQRENRR EREVSGVPDQ TAGDEYGAGP VAALRRIAFL LERGREDTYK 
VKAFRGAAAA ILPLTAEQVA AAVEDGSLTS LPGVGASTAR VIADAVRGVL PTRLAELERE
HGGDLASGGQ GLRAALRGDL HSHSDWSDGG SPIEEMAFTA IELGHDYLVL TDHSPRLTVA
HGLSAERLTR QLGVVDAVNR HLSGVDDSFT LLKGIEVDIL DDGSLDQDDD LLAQLDVRVA
SVHSKLKMEP ADMTRRMIGA IRNPRTNVLG HCTGRLVTGN RGTRPGSRFD AGAVFEACAE
HDVAVEINSR PERRDPPTAL LELARDAGCV FSIDSDAHAP GQLDFLVYGC ERAEAAGIDP
DRIVNTWPRE RLLAWARK