Gene Noca_4246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4246 
Symbol 
ID4596760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4483859 
End bp4484809 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content79% 
IMG OID639778852 
Productdipeptidyl aminopeptidases/acylaminoacyl-peptidases-like 
Protein accessionYP_925430 
Protein GI119718465 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.760723 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGGGGC GGATCAAGGC CCGCTACGGT CGGCCGGCGG TACGCCGGGG GCTCGCGGCA 
CCGGTCGGGC TGGTCGCCAC GGACGTGAGC CTGCTCGGCC TCGACGGCGG GACGCTGCGG
GGCTGGTTCT GCCGCCCCGA CGGCGGCCTC GACGGCGGCC CCGACGCCCC CGACGGCGGA
CCCGACTCCC CCGGCGACGG ACCCGACGAC GGTCTCGACG ACGGACCCGA CGACGGACCC
GACGACGCCC CCGACGCCCC TGACGACGGC CCGGACGGCC GCCCCGGTGC CCCCGGCGCG
GTGGTGCTGC ACGGCTGGGG CGGCGCGGCG GCGGACATGG CCCCGGTCGC CCAGCCGTTG
ATCGAGGCTG GCGTCCACGC CCTGCTCCTG GACGCCCGGT GTCACGGCCG CAGCGACGAC
GCGGAGTTCA CCTCGATGCC GAGCTTCGCC GCGGACCTGG CGGCCGGCGT CCGGTGGCTG
CGCGAGCAGC CCGGCATCGA CCCCGACCGG GTCCTGCTCG TGGGCCACTC CGTGGGCGCC
GGGGCGTGCC TGCTGGCGGC CCGCGAGGAC CCGCGCATCG CCGCGGTGAT CAGCCTCTCC
TCGATGGCGG ACCCGCGGGA GGTGATGGCC CGGCTGCTGA CCGGCGGCGG CGTCCCGCGC
CCGCTCGTCC CGGTCTCGCT GCGGGTGGTG GAGCACGTCA TCGGGGCCCG GTTCGCCGAC
TTCGCGCCGC TCGCCACGGT GGCGGCCCTC GACGTCCCGG TCCTGCTGGC GCACGGGGTC
CGGGACGCCG TGGTCCCGGT CGCGGACGTC CACCGGTTGG CCGCCGTCGC GCGCGACGCC
ACCGTGCTGG AGCTGCCCGA TGCCGGGCAC GCGGAGCCGG TCGACACCAC GGTGCTGGCC
GACGCGCTGC GCGCGTTCGC GCGCCGCACC GTCGCCGGCC ACCCGGGCTA G
 
Protein sequence
MLGRIKARYG RPAVRRGLAA PVGLVATDVS LLGLDGGTLR GWFCRPDGGL DGGPDAPDGG 
PDSPGDGPDD GLDDGPDDGP DDAPDAPDDG PDGRPGAPGA VVLHGWGGAA ADMAPVAQPL
IEAGVHALLL DARCHGRSDD AEFTSMPSFA ADLAAGVRWL REQPGIDPDR VLLVGHSVGA
GACLLAARED PRIAAVISLS SMADPREVMA RLLTGGGVPR PLVPVSLRVV EHVIGARFAD
FAPLATVAAL DVPVLLAHGV RDAVVPVADV HRLAAVARDA TVLELPDAGH AEPVDTTVLA
DALRAFARRT VAGHPG