Gene Noca_0613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_0613 
Symbol 
ID4596030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp650063 
End bp651214 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content65% 
IMG OID639775220 
Productprotein of unknown function DUF1100, hydrolase family protein 
Protein accessionYP_921834 
Protein GI119714869 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCGGA TTCGCGGAAC ATGTAGACGG CCGACGGCTG CGCAGTCGGG AAGATGCTGC 
CTCCCGACCT ATGGAGAAGA AATGACGACC GAGACCCCAC AAGTCCCGAC GGCCGAAGAA
GAGATGCTGA ACTGGGGCCG CCTGTTGATG GACGGTCTGC CGTTTGCCGA CCTCCTGGCC
GCACAGCGTC GGGACGCTGA GACGTCATGG TTCGACTTCT GGATGGCGAA GGCGGACGCG
TACGAGGCCA TGGGTGAGAC CGCCCTGAAG GACGGCAACG ACCTCACGGC CGGCTACTGG
TTCTGGGTCG GATCGATGGC CGCGCAGTAC GCGCAGTTCC TGTGGTTCGA CGAACGCCGC
CCGAAGGGCC AGCTCCGCAA GGCAGGGCTC TATCACCGCG CCGCTCCACT GCTCGACCCC
CCGGCTGAGC GTGTGGACCT CCCTATCGAC GACACCGTGA TCATCGGCTA CCTGCGCCGG
CCCAAGGGGG CCGACGGGCC CGTCCCGTGC GCAGTGCTGC TCGGCGGCCT CGAGAGCACG
AAGGAGGAGA GCTACATGTT CGAGAACCTC CTACTCGAGC GAGGCGTCGC CACCTTCACG
TTCGACGGTC CCGGCCAGGG TGAGATGTTC GAGGACGTCG CACTCGCCGG CGACTACCAC
CGCTACACCT CACGGGTGGT CGACTACCTC GAGACCCTAG GGGTCACTAT CGACGAGGAT
CGGATCGGAG TGCTGGGCCG CAGCCTGGGA GGACATTATG CGCTGCGTGC CGCCTCAATG
GACGACCGCT TCCGTGCATG CGTCACCTGG GGTGGTTTCG TCCAGATGGA CGACTGGGAT
TTCGAGTACC CCATGACCAA GCTCAGCTGG CAGTACGTAA CGAAGTCGCC CGATCTGCCC
ACTGCGCAGG AGCGGGTGAA GGAAGCCATC GACGTACGCC CCGTGCTGGC TGGACTGAAG
GTACCCACGT ACGTGATGCA CGGGGCCAAG GACGAGACTC CTCTCACCGA GCTGGATCTG
TTGCAGGAGT ACGCGGTCAA CGCGGACATC ACGATTGATC TCGAGCCCGA GGGCGATCAC
TGCTGCCACA ACCTCGGTCC CGCCCCGCGC CTTCGCATGG CGGACTGGCT GGCGAACCAG
CTTCGCCCCT GA
 
Protein sequence
MPRIRGTCRR PTAAQSGRCC LPTYGEEMTT ETPQVPTAEE EMLNWGRLLM DGLPFADLLA 
AQRRDAETSW FDFWMAKADA YEAMGETALK DGNDLTAGYW FWVGSMAAQY AQFLWFDERR
PKGQLRKAGL YHRAAPLLDP PAERVDLPID DTVIIGYLRR PKGADGPVPC AVLLGGLEST
KEESYMFENL LLERGVATFT FDGPGQGEMF EDVALAGDYH RYTSRVVDYL ETLGVTIDED
RIGVLGRSLG GHYALRAASM DDRFRACVTW GGFVQMDDWD FEYPMTKLSW QYVTKSPDLP
TAQERVKEAI DVRPVLAGLK VPTYVMHGAK DETPLTELDL LQEYAVNADI TIDLEPEGDH
CCHNLGPAPR LRMADWLANQ LRP