Gene Noca_4378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4378 
Symbol 
ID4596896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4628107 
End bp4629318 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content76% 
IMG OID639778988 
Productcupin 4 family protein 
Protein accessionYP_925562 
Protein GI119718597 
COG category[S] Function unknown 
COG ID[COG2850] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.123622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCGCGC TCGACCGGCC CGCCCTGGAC CCGCCTGCCC GCGAGCTGCC TGCCCGGGAG 
CTGCCTGCCC TCGAGTTGTT GAGCGGTGAC GCCCAGACCT TCCTGGCGAA GGTCTGGGCG
TCGCGCGTGC ACCTGCACCG CAGCGGCCCC GCCGACCCCG ACAGCCCCGG CAGCGCCGAC
GGCCCGGACA GCCTGGTCGG GCTGTTCGCG CTCGCCGACG CCGACCACCT GCTGACCTCG
AGCGCCGTCC GGACGCCGTC GATCCGGCTG GCCAAGGACG GCGCGGTGCT CCCGGAGTCG
GCGTACACCC GACGGGCGAG CCTCGCCGGC AAGCCGCTGA CCGGGCTGGT CGACGCCCGC
AAGGCGCTGG CGCTCTTCGA CGACGGCGCG ACCGTCGTCT TCCAGGGCCT GCACCGCTAC
TGGCCCCCGC TGACCCGGCT GATCGCCCGG CTCGAGCTCG AGCTGGGCCA CCCGTGCCAG
GCCAACGCGT ACCTCACCCC GCCGGGCGCG CAGGGCTTCG CGGTGCACTC GGACTCCCAC
GACGTGTTCG TGTTCCAGAC CGCCGGCTCG AAGCGCTGGG AGGTGCACGG GCCGGACGGC
CCCGAGGAGG TGCTGCTCGA GCCCGGGGTG TCGATGTACC TGCCGACCGG CACGCCGCAC
GCGGCCCGTG CCCAGGACAC CGTCTCCTTG CACGTCACGC TCGGCATCAA CCAGCTCACC
TGGCGCGGCC TGGTCGAGCG GACCGTGGCC GGGGCCCTCG GCGAGGTGGC CGACGAGCAC
CTGCCGGCCG GCTACCTCGA CGACCCGGCC GCGCTCGCCG GCCCGCTCGC GGACCGGCTC
GAGCGACTCG CGGACGCCGT CCGCCGCCTG GACGCGACCG CCGCCGTCGA GGCCGAGGTG
CGGCGGTTCC TCACCTCGCG GCCGCCGCGC CTGGACGGCG GGCTGCACGA CGTGCTCGCC
CACGGCACGA TCACCGACAC CACCCTGCTG CGCCGCCGGC CCGGCCACCC CTGCGTGCTC
CTCGACCGGG GTGAGCGGGT CGAGGTGCTG CTCGGCGACC GGTCGCTGAC CGTGCCCGCG
TGGATCCGCC CGGCACTCGA GGCGGTCCGC GCTCGCGGCG AGCTGACGCC GGCCGACCTG
CCGCTCGACG AGCAGAGCCG CCTGGTGCTG TGCCGACGAC TGGTCCGGGA GGGCCTCCTG
GAGGTCCGGT GA
 
Protein sequence
MSALDRPALD PPARELPARE LPALELLSGD AQTFLAKVWA SRVHLHRSGP ADPDSPGSAD 
GPDSLVGLFA LADADHLLTS SAVRTPSIRL AKDGAVLPES AYTRRASLAG KPLTGLVDAR
KALALFDDGA TVVFQGLHRY WPPLTRLIAR LELELGHPCQ ANAYLTPPGA QGFAVHSDSH
DVFVFQTAGS KRWEVHGPDG PEEVLLEPGV SMYLPTGTPH AARAQDTVSL HVTLGINQLT
WRGLVERTVA GALGEVADEH LPAGYLDDPA ALAGPLADRL ERLADAVRRL DATAAVEAEV
RRFLTSRPPR LDGGLHDVLA HGTITDTTLL RRRPGHPCVL LDRGERVEVL LGDRSLTVPA
WIRPALEAVR ARGELTPADL PLDEQSRLVL CRRLVREGLL EVR