Gene Noca_4371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4371 
Symbol 
ID4596889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4620780 
End bp4621931 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content70% 
IMG OID639778981 
Producthypothetical protein 
Protein accessionYP_925555 
Protein GI119718590 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGACC TGACCCGTCC TCTCCAGGAC GCGATCGCCG AGGCGGAGAA GCTGATCGAG 
AGCGCGCCGT TCATCCGCAC CGAGCAGGAC CTGCTGGAGG GCTACGACTA CCTCTCCGGG
CGGATCCGGA TGGCGCTGCA GATGGCCTTC GACCACGACC TCGCGCGGCC GCTGTTCATC
AACCCCACCC ACCAGTTCTC CCGCCAGGGC CTGGACAACC CCGATGCCAT CTACTTCAAC
GCCTACCTCG AGGAGGGCGT CGAGTACGTC GTGCGCGGCG TGCGCGGCAG CACCGCCGAC
CTGTCCTTCC AGGTGATGGG CGGGGCCTAC ACCGCCGACT CGGCGGCCAC GTCGATGCTC
GCGTTCGACG ACCGCGAGCT CGACCTCGCC GAGGACGGCT CGTTCGAGTT CAGCTACGTC
GCCGAGCCGG GCGCGAAGAC GATGATCGTG CGCGAGGTCT TCAACGACTG GGACACCGAG
GAGCGCGGCC GGATCTGGAT CGAGCGCACC GACACCCTCG GGCTCCCGGC CGCGCCGCTC
ACCCGGGCGC GGCTGGAGCG GAAGTACGAG GTCGCCGCCA AGCTGCTGAC CGGGTCCATC
CGGACCTGGC TGGCGTTCCC CCAGTTCTTC GAGCGCCAGG AGCCCGCCAA CCAGCCGACC
CCGCCGAGGT CGACGCCCGG CGGTCTGTCG TCGCAGCGCT CGTCGATCGG CCACTACGAG
CTCGACGACG ACCAGGCGCT GATCATCACC GTCCCCGAGT GCACCGACTG CGCCTACCAG
GCGATCCAGA TCGGCTCGGA CTGGTACGTC TCCACCGACT ACGAGACCCA CCAGACCTCG
CTGACCAAGG CCCAGGCCGT GGTGGATCCC GACGGCCTGA TGCGGTTCGT CATCTCCGAG
CGCTCCCCCG CCGGTCCCGA CGCGCGGCTC GCCAACTGGC TCGAGTGCAC CGGCCACCGG
ACCGGGTCGC TGATGCTGCG CTGGCAGCGC CTCGAGCGCG ACCTCGGCCC CGCGGACGGC
CCCGTCGCCG AGGTCGTCGC GCTCGCCGAC GTACCGGACA GGCTGCCCCA CTTCACCCCG
ATCACCACCG AGCAGTACGC CGAGCGGATC GCCGCCCGGC AGCGCTCCGT CGCCCGAAGG
ATGCTGAGCT GA
 
Protein sequence
MGDLTRPLQD AIAEAEKLIE SAPFIRTEQD LLEGYDYLSG RIRMALQMAF DHDLARPLFI 
NPTHQFSRQG LDNPDAIYFN AYLEEGVEYV VRGVRGSTAD LSFQVMGGAY TADSAATSML
AFDDRELDLA EDGSFEFSYV AEPGAKTMIV REVFNDWDTE ERGRIWIERT DTLGLPAAPL
TRARLERKYE VAAKLLTGSI RTWLAFPQFF ERQEPANQPT PPRSTPGGLS SQRSSIGHYE
LDDDQALIIT VPECTDCAYQ AIQIGSDWYV STDYETHQTS LTKAQAVVDP DGLMRFVISE
RSPAGPDARL ANWLECTGHR TGSLMLRWQR LERDLGPADG PVAEVVALAD VPDRLPHFTP
ITTEQYAERI AARQRSVARR MLS