Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_3810 |
Symbol | |
ID | 4599033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008699 |
Strand | + |
Start bp | 4028263 |
End bp | 4029480 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639778418 |
Product | exonuclease, RNase T and DNA polymerase III |
Protein accession | YP_924997 |
Protein GI | 119718032 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0847] DNA polymerase III, epsilon subunit and related 3'-5' exonucleases |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.730242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGCT ACACCATCAT TGACCTCGAG ACCACCGGGC TCTTCCCGCA GAAGCATGAC CGAATCGTTG AGATCGGCCT CATTGAGGTT TCAGACTCAG GAGCGATCGA GCGCGAGTGG GCCACCCTCG TCAACCCGCA GCGCGACGTG GGCCCGACGC ACATCCATGG CATTACGGCG CGCGACGTGC TCGATGCCCC GACCTTCCAG GAAGTTGCTC CATACCTGAT CGGTTCGCTG GCCGGTCGTA CTCTGGTTGC GCACAACGCA CGCTTCGACA CCCAGTTCCT CGACTACGAG TTTGAGCGCG CCAGCGTCGG AACGCGACCG CCTACGCCGT CACTGTGCAC CATGCAGCTG TCGAGTTCGT ATCTACGAGG CGCGTCTCGA AAGCTGAAGG ACTGCTGCGT TGCCGCAAAC GTTCCGCACG CGGACGAGCA CACGGCCCTC GGTGACGCCC GAGCCGTCGC TGGTCTTCTC AACTATTACC TCGTCAATAC CGACAGACCG GTGCCGTGGT CCGCTGTCTT GGAATCCACA CGGCGTCACT GGTGGCCGGC CCCCGGGCCA ACGCCGGGCA GGCCCCGCTC AGTACTTCGG TCTGCCACTC CTCGAGAACC GGCCGCCTGG CTGGACCGCA TCACCTCGAC CCTGCCGCGT AACCCCAATC CCATGGTCGA GGCGTACCTC GACGTTCTGG AGCAGGCACT GCTCGACGGA TACCTGTCCG CACACGAGGA GAACGCCCTC GTTGACCTCG CGCTATCTCT TGGCTTGCAC CGCGACCATT TGGCCGCGGT CCATGCCACG TACCTCGATT CAATGGCGAT CGCGGCCTGG GCCGACGGCA TGGTGACCGA GACCGAGCTG GCGGATCTCA CTAGTGTTGC GACGGCGCTC GGCTTACCGA CCGACTTGGT GAGGGTGGCC ATCAAGAGGG CCAAGAACGT CGCGGCGCAC GCGACGAGCG AGGGCGGTTT CAAGTTGTGT GTTGGTGATC AGGTTGTGTT CACCGGCGAG CTGTCTGTCC CACGCGACCA GTTAATCGAC TTGGCACAAC AAGCTGGCTT GAAGCATGGC GGAGTCAATA AGAGCACCAA GCTCGTCGTC GCCGCAGATC CCGATTCCCA GAGTGGCAAA GCCGCGAAGG CTCGCAGCTA CGGAATACCC GTCGTCACGG AGGCAGCCTT CGCTCGTATG CTCGCCGACC TCCACTAG
|
Protein sequence | MTGYTIIDLE TTGLFPQKHD RIVEIGLIEV SDSGAIEREW ATLVNPQRDV GPTHIHGITA RDVLDAPTFQ EVAPYLIGSL AGRTLVAHNA RFDTQFLDYE FERASVGTRP PTPSLCTMQL SSSYLRGASR KLKDCCVAAN VPHADEHTAL GDARAVAGLL NYYLVNTDRP VPWSAVLEST RRHWWPAPGP TPGRPRSVLR SATPREPAAW LDRITSTLPR NPNPMVEAYL DVLEQALLDG YLSAHEENAL VDLALSLGLH RDHLAAVHAT YLDSMAIAAW ADGMVTETEL ADLTSVATAL GLPTDLVRVA IKRAKNVAAH ATSEGGFKLC VGDQVVFTGE LSVPRDQLID LAQQAGLKHG GVNKSTKLVV AADPDSQSGK AAKARSYGIP VVTEAAFARM LADLH
|
| |