Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noca_4770 |
Symbol | |
ID | 4595369 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardioides sp. JS614 |
Kingdom | Bacteria |
Replicon accession | NC_008697 |
Strand | - |
Start bp | 73249 |
End bp | 77244 |
Gene Length | 3996 bp |
Protein Length | 1331 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639772559 |
Product | putative type II DNA modification enzyme |
Protein accession | YP_919219 |
Protein GI | 119714077 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.773591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.43682 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACGGCG GACTGACCTC GGTCCGCGTG GCCGGCGCGC TGATGCCCGG CGACGTGCTC TCAGCTGTCC TGGCCGGCGA CCTGGACGGC CTCACGGGCT CGGCGTACCA CCTGGGGTCC GAGAGTCCCC GGGAGGCGGC CGCACGGGTC TGGACACACC TGCTCGGGGT CTACCGGCGG TTCCGCTCCG ACCTCGACAG CCTGCCCGAC GAGGACCCCG CGGTCGGGCT CACACGCGAA CGTTGGCTCA CGCTCCTGCT CTCCGAGCTC GGCTACGGCC GGGTGCCGCC CACGCCGGCG GGTGGCTTGG CCGTGGGGGA CAAGCAGTAC CCGGTCAGCC ACCTGTGGGG CGCCACCCCG ATGCACCTGC TCGGATGGGG AGTCCCGCTC GACAAGCGGT CCGCCGGTGT AGCCGGGGCG GCACGCGCTC CGCACGCGAT GGTGCAAGAA CTGCTCAACC GCACCGATGA GTACCTCTGG GCGGTCGTGG CCAACGGCCG CGTGCTACGC CTGCTGCGCG ACTCCACCAC CCTCACCGGG CAGGCCTACG TCGAGTTCGA CCTTGAGGCG ATGTTCGACG GCGAGCTCTT CGCCGAGTTC GCCCTCCTCT ACCTGCTCTG CCACCAGTCC CGCGTCGAAG TGCCGGATGA CGGCCAGCCC GCCGACTGCT GGCTCGAGCG CTGGCGCGTC ACCGCTGTGA GCCAGGGCGT GCGTGCCATG ACGCTGCTGC GTGACGGGGT CGAACTGGCG CTGGAGACCC TCGGCACGGG CTTCCTCCAG CACCCTGCGA ATACCAAGCT GCGCGACCGC CTGGAGCGCG GCGAGATCCG CCTCAGCGAC GTCCACGCCG CGCTGCTGCG ACTGGCCTAC CGACTCCTCT TCTGGGCCGT CGCCGAGGAC CGTGACGCCC TGCTCAGCCC AGACGCCAGC CAGGAGCAGC GAAGGCGCTA CGCCGAGCAC TTCTCCTCCA CGCGCCTTCG TCGTCTGGCC GTCCGCCGTC ACGGCAGCGG CCACGACGAT CTGTGGCAGG CCGCCACGTT CGTCCTAGAC GCCCTCGGCC GCAAGGAGGG TGAGCAACGC CTCGGGCTCC CTGGCCTCGG TGGCCTCTTC GCACCCACCG CAGCCGATGT CCTAGCCGGC AGCCGACTTC CGAACGCCGC CCTTCTCACC GCCGTTCGCT CGCTTGCGGT CGTGCAACCG AAGGGCCAGC CGCAGCGACT GGTCGACTTC GCCCACCTCG GCGCCGAAGA ACTCGGCTCG ATCTACGAGT CCCTGCTGGA GCTGGTGCCA CGCCACGACC CGACCACCCA CGCCTTCACT CTGGAGACCC TCGCCGGCAA CGACCGCAAG ACCTCCGGCA GTTACTACAC CCCCACCGAA CTCGTCGAGC TCGTCCTCGA CACCGCACTC GACCCCGTGC TGGACGACGC CGAGAAGAAC GCCCACACCA CCGAGGAGGC CGAAGCAGCC CTACTGGGGC TGCGGGTGTG CGACCCGAGC GTCGGATCGG CGCACTTCCT GGTAGCTGCT GCCCGCCGCA TCGCCACTCG ACTCGCCACC GTGCGCACCG GCGAAGTCGA CCCCACCCCA ACCGCCTACA GCGACGCAAT GCACGACGTC GTCGCCCGCT GCGTCTACGG CATCGACATC AACCCGATGG CGGCCGACCT GGCCAAGGTC AGCCTCTGGC TCACCGCCAT GAGCCCCGGC CGCCCCCTAT CGTTCCTCGA CCACCACATC AAGGTCGGCA ACGCCCTCCT CGGCACCACT CCCGCCCTCA TCCATGGGGG CATTCCCGAC ACCGCCTACG TCGCCCTCAC CGGCGACGAC AAGTCCGCCG CGTCCACATT GAAGAAGCGC AACGCCACCG AACGCGGTCA GGGCGACCTC TTCGACGACG CCGGCATCGA TATCGACACC GCAAGCCTGC GCAAGGCCAC CGCCGAGATC ACCGACCGCG CCGCCGCAGC CACCACCGTG GACGACGTCG CCTGGGCCGC CCAGCGCTAC GCCGACCTAC AGAGCGATCC CGACATCATC CGCGCCCGCC GCGTCGCCGA CGCCTGGTGC GCGGCCTTCC TCGGCCCCAA GACCGCCGAC GCCGAGCCGA TCACCCACCG GGCCCTGACC GCCATCGCCG ACGAAGCCGC TCCCGACCCC GTTGTGAAAG CCGTCGACGA GCTCGCGACC CGGCACCGGC TGTTCCACTG GCATCTGGAG TTCCCCGACG TGTTCCGCGT GCCCGACGAC GGTCTCGCGC GCGGTCCGTA CGGCTGGACA GGTGGCTTCG ATGCGGTCCT CGGCAATCCA CCGTGGGAGC GCATCAAGCT CCAGGAGCAG GAGTTCTTCG CGATCCGCGA GCCGGCCATT GCCGAGGCGA AGAACGCCGC CGCCCGCAAA AAGGCCATCG CGGCACTCGC CGAGACCGAC CCAGACCTGT TTGGCGAGTT CAACGCCGCG CGCCGACAGA GCGAAGCCGA GAGCCAGTTC CTCCGCGGCA GCGGCCGTTA CCCGCTGTGC GGCGTCGGCG ACGTAAATAC CTACAGCGTC TTCGCCGAGC ACTTCCGCGC CACCCTCGCT CCGACCGGCC GCAGCGGCAT CATCACCCCG ACCGGCCTTG CGACGGACGC CACCACCGCC GCCTTTTTCG CCGACACGAT CACGTCTGGG CGCCTCGCGG CCTTCTTCGA TTTCGTCACC GGACCGGAGA TCTGGAGCGG GATAGGCCAC AACAGGTTCC GCTTCGCCGT GTCCTCGACC ACTGGTGGTG AGCGCATCCC CGAGGCGCAG CTCTCCTTCG ACAACCGGCA TCCGCGCGAC CTTCAGATCG CAGATCGCAA ATACAGCCTG CCATCCGATG ACCTGGTCCT GTTGAACCCG AACACCGGAA CGTTGCCCAT TTTCGCGGAC ACCCGCGACG CAGAGGTGAC CCTCGCCTGC TATCGCCGAC ACCCCATCCT GATCCGCGAC GGCGGACGCA ATCCCTGGGG GCTTCGCTTT TCCCGTCTCT TCGACATGGC AAACGACAGC GCTCTTTTCC ACACCGTCGA GGACCTTGAA GACCTCGAGG CCACCTTCGA CGGCTGGGCC TGGACCCACG CCGACCAGCG CTGGCTGCCT CTGTACGAAG CGAAGATGCT GAGCCACTGG AACTCCCGCT TCTCGGGATA TGCCGACGTC CCCGAGGGCT ACCAGGGGAC TGCGTTGCCG CGACTCACCG ACGAGCGGTT GGACGACCCC GCCTCCGAAC CGATGGCCCG CTACTGGGTG CCTGAGGCCA ACGTCACCAA GGCGATCCCC GAGGGATGGG ATCGAAACTG GTTGTTCGGG TGGCGCGACA TCGCCCGATC CAGCGACATG CGCACGTTCG TCCCGAGTGT GCTCCCACGC GCCGCGGTGG GGGACAAGTT CCTGCTCGCC TTCGCAGCGG CTCCGAGCAA GACGCCGTTC TTGCAGGCGG TTTGGTCCTC GCTGATCTTC GACTACATCT CGAGGCAGAA GATCAGCGGC ACAGGAATGA AGTACTTCTT GACCAAGCAA TTGGCGTGCC CAGAGCCGGA GGCATTCGAT GGCGTACCAG CTTGGTCTCA AGAGCCCTTG GGGGCCTTTG TGCGGGCTCG AGTTCTGGAG CTGACCTACA CGAGTGAAAG GCTCGCAGCG TACGCCGTCG ACGTTCTTTC AGGCGAGCCC GGCACAACGG ATCCCGGGCC GCCGTTCCGA TGGGTTCCTG AGCGCCGGGA GCAGCTGCGC GCCGAGCTTG AGGCCGCCAT GCTTTGCCTC TACGGCCTCG ATCGCGAGGA TGCGGAATAC GTCCTCGATT CGTTCGTCTT GGTATGCAAG TACGAGGAGC GCGACCACGG GGAGTTCCGG ACCAAGCGGC TCGTGCTTGC CGCCTACGAC GCCATGGCAG CCGCTGCCGA GAGCGGCGTG CCGTTCGTCA GCCCGCTGGA CCCGGCCCCC GGCGAAGGCC CTCGACACCT GGAGCGTGAG TCGTGA
|
Protein sequence | MNGGLTSVRV AGALMPGDVL SAVLAGDLDG LTGSAYHLGS ESPREAAARV WTHLLGVYRR FRSDLDSLPD EDPAVGLTRE RWLTLLLSEL GYGRVPPTPA GGLAVGDKQY PVSHLWGATP MHLLGWGVPL DKRSAGVAGA ARAPHAMVQE LLNRTDEYLW AVVANGRVLR LLRDSTTLTG QAYVEFDLEA MFDGELFAEF ALLYLLCHQS RVEVPDDGQP ADCWLERWRV TAVSQGVRAM TLLRDGVELA LETLGTGFLQ HPANTKLRDR LERGEIRLSD VHAALLRLAY RLLFWAVAED RDALLSPDAS QEQRRRYAEH FSSTRLRRLA VRRHGSGHDD LWQAATFVLD ALGRKEGEQR LGLPGLGGLF APTAADVLAG SRLPNAALLT AVRSLAVVQP KGQPQRLVDF AHLGAEELGS IYESLLELVP RHDPTTHAFT LETLAGNDRK TSGSYYTPTE LVELVLDTAL DPVLDDAEKN AHTTEEAEAA LLGLRVCDPS VGSAHFLVAA ARRIATRLAT VRTGEVDPTP TAYSDAMHDV VARCVYGIDI NPMAADLAKV SLWLTAMSPG RPLSFLDHHI KVGNALLGTT PALIHGGIPD TAYVALTGDD KSAASTLKKR NATERGQGDL FDDAGIDIDT ASLRKATAEI TDRAAAATTV DDVAWAAQRY ADLQSDPDII RARRVADAWC AAFLGPKTAD AEPITHRALT AIADEAAPDP VVKAVDELAT RHRLFHWHLE FPDVFRVPDD GLARGPYGWT GGFDAVLGNP PWERIKLQEQ EFFAIREPAI AEAKNAAARK KAIAALAETD PDLFGEFNAA RRQSEAESQF LRGSGRYPLC GVGDVNTYSV FAEHFRATLA PTGRSGIITP TGLATDATTA AFFADTITSG RLAAFFDFVT GPEIWSGIGH NRFRFAVSST TGGERIPEAQ LSFDNRHPRD LQIADRKYSL PSDDLVLLNP NTGTLPIFAD TRDAEVTLAC YRRHPILIRD GGRNPWGLRF SRLFDMANDS ALFHTVEDLE DLEATFDGWA WTHADQRWLP LYEAKMLSHW NSRFSGYADV PEGYQGTALP RLTDERLDDP ASEPMARYWV PEANVTKAIP EGWDRNWLFG WRDIARSSDM RTFVPSVLPR AAVGDKFLLA FAAAPSKTPF LQAVWSSLIF DYISRQKISG TGMKYFLTKQ LACPEPEAFD GVPAWSQEPL GAFVRARVLE LTYTSERLAA YAVDVLSGEP GTTDPGPPFR WVPERREQLR AELEAAMLCL YGLDREDAEY VLDSFVLVCK YEERDHGEFR TKRLVLAAYD AMAAAAESGV PFVSPLDPAP GEGPRHLERE S
|
| |