Gene Noca_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4474 
Symbol 
ID4596993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4729658 
End bp4730905 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content70% 
IMG OID639779085 
Productpeptidase M24 
Protein accessionYP_925658 
Protein GI119718693 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.512604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCC CGAACCGACC GCTCCCGGCG CCCGGGCACA TGGCGGTCGA CTACGAGGAG 
CGGGTCGACT TCGACCGGCT GCGCCACTAC CGGCTCGGCC GGGCCCAGGC CGCGCTGGAG
GGCAGCGAGT GCGGGGCCTT CCTGCTCTTC GACTTCTACA ACATCCGCTA CACCACGCAC
ACCTGGATCG GCGGGGCGCT CGGCGACAAG ATGATCCGCT ACGCGCTGGT CGCGCGCGGC
AAAGAGCCGG TGCTCTGGGA CTTCGGGTCC GCGGTCAAGC ACCACAAGAT CTACTCCCAG
TGGGTGCCCG AGGAGAACTA CCGGGCCGGG TTCCTCGGCT TCCGCGGCGC GGTCGCCCCG
AGCGTCGGGC TGATGGAGAC CGCGGTCGCG GAGATCAAGT CGCTGCTGGT CGAGGCCGGC
GTCGCCGACC TCCCGGTCGG CGTGGACATC GTGGAGCCGC CGTTCCTCTT CGAGATGCAG
CGTCAGGGCC TGACCGTCGT CGACGCCCAG CAGCTGATGC TCGACGCACG CTGCATCAAG
TCCCACGACG AGATCGTGCT GCTCAACCAG GCCGCCGCGA TGGTCGACGG CGTCTACCAG
GACATCGTCG AGGCGCTCAA GCCCGGCGTG CGCGAGAACG AGATCGTCGC GCTCGCCAAC
AAGCGGCTCT ACGAGATGGG CTCGGACCAG GTCGAGGCCG TCAACGCGAT CTCCGGCGAA
CGCTGCAACC CGCACCCGCA CAACTTCACC GACCGGCTGA TCCGCCCCGG CGACCAGGCG
TTCTTCGACA TCATCCACTC CTTCAACGGC TACCGGACCT GCTACTACCG CACGTTCTCG
GTCGGCAGCG CGACCCCGGC CCAGCGCGAC GCCTACACCC AGGCGCGGGA GTGGATGGAC
CGCGGCATCG ACGGCATCCG CGCCGGCGTC GGCACCGACG AGGTGGCCGC GCTGCTGCCC
GAGGCCGAGG AGTTCGGCTT CGGCTCCGAG ATGGAGGCCT TCGGCCTCCA GTTCGCCCAC
GGGCTCGGCC TCGGCCTGCA CGAGCGGCCG ATCATCTCCC GGCTCAACTC GATGAAGGAG
CCGGTCGAGC TCCAGGTCGG CATGGTCTTC GCGCTGGAGA CCTACTGCCC GGCCTCCGAC
GGCGTCTCCG CGGCCCGGAT CGAGGAGGAG ATCGTGATCA CCGAGGACGG CCCCCGGGTG
CTCACCCTCT TCCCGGCGCA GGACCTGGTC GTCGCCAACC CCTACTAG
 
Protein sequence
MSAPNRPLPA PGHMAVDYEE RVDFDRLRHY RLGRAQAALE GSECGAFLLF DFYNIRYTTH 
TWIGGALGDK MIRYALVARG KEPVLWDFGS AVKHHKIYSQ WVPEENYRAG FLGFRGAVAP
SVGLMETAVA EIKSLLVEAG VADLPVGVDI VEPPFLFEMQ RQGLTVVDAQ QLMLDARCIK
SHDEIVLLNQ AAAMVDGVYQ DIVEALKPGV RENEIVALAN KRLYEMGSDQ VEAVNAISGE
RCNPHPHNFT DRLIRPGDQA FFDIIHSFNG YRTCYYRTFS VGSATPAQRD AYTQAREWMD
RGIDGIRAGV GTDEVAALLP EAEEFGFGSE MEAFGLQFAH GLGLGLHERP IISRLNSMKE
PVELQVGMVF ALETYCPASD GVSAARIEEE IVITEDGPRV LTLFPAQDLV VANPY