Gene Noca_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1039 
Symbol 
ID4599700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1095685 
End bp1097034 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content62% 
IMG OID639775638 
Producthypothetical protein 
Protein accessionYP_922245 
Protein GI119715280 
COG category[R] General function prediction only 
COG ID[COG2232] Predicted ATP-dependent carboligase related to biotin carboxylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.71771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGAAC GGGCGGCGGT CCTCAAGAAG GTGACCGAGG AGCTCGGGGA CCGCACACTC 
GTGTGGTCAG GGATCCGTGG GGACGACATC GAGCCACTAC GCGACGTACC GCAACTTGGG
TACTCGTTCT CGATCGCCGG GGCGTACAAC GGTCGCTCTG ACGTTCGTGG TCTGGCCTAC
GAACAGCTCA CTCGAGTACG TGTCGATCCG GAGGTTTGGG ACATCGACTT CCACCACCAT
GAAAGAGCTA CGGAGGAGTT CCGGCGGGGA CTTCTCCGAT CCATGAGCGA GCCAAGCGCA
CTCCTTCCGT ACCGTCCGTC AAGCTTCCTC TCGGCGATCC ACTTCGCTCG CCAGGAACGC
TGCGTGAACC TCGGCCTGTT TGGCGCGCAC CAATCCGCCT TCGAGCACAA GCCCTTCGTC
GAGTCAGAAG TCGCCCAACT GGGAATCCCG CACATCCCGT GGACCTATAT CGCCGACGAG
GAACAGCCCC TCGCGCGCAA CCTCGTTTCC GCCGGTCCTC TTGTCCTGCG CCGCAGTCGC
ACGTCGGGTG GTGAAGGCAT CGTGCGCGTC GACTCAGCCG CTGAGATTGG ACCACACTGG
CCGAAGGGCA CCGAGGAGTT CGTGAGCGTG GCGCCCTTCA TAGCCGACAC AATTCCCGTG
AATGTCGGTG CCACTGTTTG GCGGGACTCT CGAGGCGACG ATTGCGTGAC GGTGCACCAC
CCGTCGGTCC AGCTCATCGG CATCAAGTCA TGTGTGACCC GCGAGTTCGG CTATTGCGGC
AATGACTTCG GTGCGGCGCG CGACCTCGAC CGGTCCACTA TCGATCGGAT CGAGCAGTCC
ACCAAGCGCG TCGGGCGCTG GTTGCGCGGG AACGGTTACA TCGGCACGTT TGGCGTCGAC
TACCTGATCA AGGATGGAGT GCCCCTGTTC ACCGAGATCA ACGCGAGATT TCAAGGCTCT
ACATACTCGT CGTGTCGACT CTCGATCGAG CAGGGGGAAG CCTGTCTGAT GCTTGAGCAC
GTCGCCGCCT GGTTGGGCCT ACCCATGCCG GAATCACCGA GCTTGTACGA GCGCACACGA
GCAGTGCGGG ATCTCGCGAA CCTAGCGGTG CACTGGACCG GCCGAGAGGC CGCAACGGTC
AACGCAACCG CTCTCTTCAG TGTGGTGATG GATGACTTCG ATCCCAAAGC GACCAGCGAC
AGCGTTGTGG CAACGGACGT CGCCAACGAG CCAGGCTCCC TGGTCGGCCG TTTCAACGTT
GGAAGGCGGG TGACCGATAC CGGCTACGAT CTTGCCCCTG AACTAGACCG ACTGATTGAC
CGCTGGCGGC GCTCGGAGGA GGCATCTTGA
 
Protein sequence
MTERAAVLKK VTEELGDRTL VWSGIRGDDI EPLRDVPQLG YSFSIAGAYN GRSDVRGLAY 
EQLTRVRVDP EVWDIDFHHH ERATEEFRRG LLRSMSEPSA LLPYRPSSFL SAIHFARQER
CVNLGLFGAH QSAFEHKPFV ESEVAQLGIP HIPWTYIADE EQPLARNLVS AGPLVLRRSR
TSGGEGIVRV DSAAEIGPHW PKGTEEFVSV APFIADTIPV NVGATVWRDS RGDDCVTVHH
PSVQLIGIKS CVTREFGYCG NDFGAARDLD RSTIDRIEQS TKRVGRWLRG NGYIGTFGVD
YLIKDGVPLF TEINARFQGS TYSSCRLSIE QGEACLMLEH VAAWLGLPMP ESPSLYERTR
AVRDLANLAV HWTGREAATV NATALFSVVM DDFDPKATSD SVVATDVANE PGSLVGRFNV
GRRVTDTGYD LAPELDRLID RWRRSEEAS