Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_1039 |
Symbol | |
ID | 4599700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | - |
Start bp | 1095685 |
End bp | 1097034 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 639775638 |
Product | hypothetical protein |
Protein accession | YP_922245 |
Protein GI | 119715280 |
COG category | [R] General function prediction only |
COG ID | [COG2232] Predicted ATP-dependent carboligase related to biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.71771 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGAAC GGGCGGCGGT CCTCAAGAAG GTGACCGAGG AGCTCGGGGA CCGCACACTC GTGTGGTCAG GGATCCGTGG GGACGACATC GAGCCACTAC GCGACGTACC GCAACTTGGG TACTCGTTCT CGATCGCCGG GGCGTACAAC GGTCGCTCTG ACGTTCGTGG TCTGGCCTAC GAACAGCTCA CTCGAGTACG TGTCGATCCG GAGGTTTGGG ACATCGACTT CCACCACCAT GAAAGAGCTA CGGAGGAGTT CCGGCGGGGA CTTCTCCGAT CCATGAGCGA GCCAAGCGCA CTCCTTCCGT ACCGTCCGTC AAGCTTCCTC TCGGCGATCC ACTTCGCTCG CCAGGAACGC TGCGTGAACC TCGGCCTGTT TGGCGCGCAC CAATCCGCCT TCGAGCACAA GCCCTTCGTC GAGTCAGAAG TCGCCCAACT GGGAATCCCG CACATCCCGT GGACCTATAT CGCCGACGAG GAACAGCCCC TCGCGCGCAA CCTCGTTTCC GCCGGTCCTC TTGTCCTGCG CCGCAGTCGC ACGTCGGGTG GTGAAGGCAT CGTGCGCGTC GACTCAGCCG CTGAGATTGG ACCACACTGG CCGAAGGGCA CCGAGGAGTT CGTGAGCGTG GCGCCCTTCA TAGCCGACAC AATTCCCGTG AATGTCGGTG CCACTGTTTG GCGGGACTCT CGAGGCGACG ATTGCGTGAC GGTGCACCAC CCGTCGGTCC AGCTCATCGG CATCAAGTCA TGTGTGACCC GCGAGTTCGG CTATTGCGGC AATGACTTCG GTGCGGCGCG CGACCTCGAC CGGTCCACTA TCGATCGGAT CGAGCAGTCC ACCAAGCGCG TCGGGCGCTG GTTGCGCGGG AACGGTTACA TCGGCACGTT TGGCGTCGAC TACCTGATCA AGGATGGAGT GCCCCTGTTC ACCGAGATCA ACGCGAGATT TCAAGGCTCT ACATACTCGT CGTGTCGACT CTCGATCGAG CAGGGGGAAG CCTGTCTGAT GCTTGAGCAC GTCGCCGCCT GGTTGGGCCT ACCCATGCCG GAATCACCGA GCTTGTACGA GCGCACACGA GCAGTGCGGG ATCTCGCGAA CCTAGCGGTG CACTGGACCG GCCGAGAGGC CGCAACGGTC AACGCAACCG CTCTCTTCAG TGTGGTGATG GATGACTTCG ATCCCAAAGC GACCAGCGAC AGCGTTGTGG CAACGGACGT CGCCAACGAG CCAGGCTCCC TGGTCGGCCG TTTCAACGTT GGAAGGCGGG TGACCGATAC CGGCTACGAT CTTGCCCCTG AACTAGACCG ACTGATTGAC CGCTGGCGGC GCTCGGAGGA GGCATCTTGA
|
Protein sequence | MTERAAVLKK VTEELGDRTL VWSGIRGDDI EPLRDVPQLG YSFSIAGAYN GRSDVRGLAY EQLTRVRVDP EVWDIDFHHH ERATEEFRRG LLRSMSEPSA LLPYRPSSFL SAIHFARQER CVNLGLFGAH QSAFEHKPFV ESEVAQLGIP HIPWTYIADE EQPLARNLVS AGPLVLRRSR TSGGEGIVRV DSAAEIGPHW PKGTEEFVSV APFIADTIPV NVGATVWRDS RGDDCVTVHH PSVQLIGIKS CVTREFGYCG NDFGAARDLD RSTIDRIEQS TKRVGRWLRG NGYIGTFGVD YLIKDGVPLF TEINARFQGS TYSSCRLSIE QGEACLMLEH VAAWLGLPMP ESPSLYERTR AVRDLANLAV HWTGREAATV NATALFSVVM DDFDPKATSD SVVATDVANE PGSLVGRFNV GRRVTDTGYD LAPELDRLID RWRRSEEAS
|
| |