Gene Noca_4770 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4770 
Symbol 
ID4595369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp73249 
End bp77244 
Gene Length3996 bp 
Protein Length1331 aa 
Translation table11 
GC content68% 
IMG OID639772559 
Productputative type II DNA modification enzyme 
Protein accessionYP_919219 
Protein GI119714077 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.773591 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.43682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGCG GACTGACCTC GGTCCGCGTG GCCGGCGCGC TGATGCCCGG CGACGTGCTC 
TCAGCTGTCC TGGCCGGCGA CCTGGACGGC CTCACGGGCT CGGCGTACCA CCTGGGGTCC
GAGAGTCCCC GGGAGGCGGC CGCACGGGTC TGGACACACC TGCTCGGGGT CTACCGGCGG
TTCCGCTCCG ACCTCGACAG CCTGCCCGAC GAGGACCCCG CGGTCGGGCT CACACGCGAA
CGTTGGCTCA CGCTCCTGCT CTCCGAGCTC GGCTACGGCC GGGTGCCGCC CACGCCGGCG
GGTGGCTTGG CCGTGGGGGA CAAGCAGTAC CCGGTCAGCC ACCTGTGGGG CGCCACCCCG
ATGCACCTGC TCGGATGGGG AGTCCCGCTC GACAAGCGGT CCGCCGGTGT AGCCGGGGCG
GCACGCGCTC CGCACGCGAT GGTGCAAGAA CTGCTCAACC GCACCGATGA GTACCTCTGG
GCGGTCGTGG CCAACGGCCG CGTGCTACGC CTGCTGCGCG ACTCCACCAC CCTCACCGGG
CAGGCCTACG TCGAGTTCGA CCTTGAGGCG ATGTTCGACG GCGAGCTCTT CGCCGAGTTC
GCCCTCCTCT ACCTGCTCTG CCACCAGTCC CGCGTCGAAG TGCCGGATGA CGGCCAGCCC
GCCGACTGCT GGCTCGAGCG CTGGCGCGTC ACCGCTGTGA GCCAGGGCGT GCGTGCCATG
ACGCTGCTGC GTGACGGGGT CGAACTGGCG CTGGAGACCC TCGGCACGGG CTTCCTCCAG
CACCCTGCGA ATACCAAGCT GCGCGACCGC CTGGAGCGCG GCGAGATCCG CCTCAGCGAC
GTCCACGCCG CGCTGCTGCG ACTGGCCTAC CGACTCCTCT TCTGGGCCGT CGCCGAGGAC
CGTGACGCCC TGCTCAGCCC AGACGCCAGC CAGGAGCAGC GAAGGCGCTA CGCCGAGCAC
TTCTCCTCCA CGCGCCTTCG TCGTCTGGCC GTCCGCCGTC ACGGCAGCGG CCACGACGAT
CTGTGGCAGG CCGCCACGTT CGTCCTAGAC GCCCTCGGCC GCAAGGAGGG TGAGCAACGC
CTCGGGCTCC CTGGCCTCGG TGGCCTCTTC GCACCCACCG CAGCCGATGT CCTAGCCGGC
AGCCGACTTC CGAACGCCGC CCTTCTCACC GCCGTTCGCT CGCTTGCGGT CGTGCAACCG
AAGGGCCAGC CGCAGCGACT GGTCGACTTC GCCCACCTCG GCGCCGAAGA ACTCGGCTCG
ATCTACGAGT CCCTGCTGGA GCTGGTGCCA CGCCACGACC CGACCACCCA CGCCTTCACT
CTGGAGACCC TCGCCGGCAA CGACCGCAAG ACCTCCGGCA GTTACTACAC CCCCACCGAA
CTCGTCGAGC TCGTCCTCGA CACCGCACTC GACCCCGTGC TGGACGACGC CGAGAAGAAC
GCCCACACCA CCGAGGAGGC CGAAGCAGCC CTACTGGGGC TGCGGGTGTG CGACCCGAGC
GTCGGATCGG CGCACTTCCT GGTAGCTGCT GCCCGCCGCA TCGCCACTCG ACTCGCCACC
GTGCGCACCG GCGAAGTCGA CCCCACCCCA ACCGCCTACA GCGACGCAAT GCACGACGTC
GTCGCCCGCT GCGTCTACGG CATCGACATC AACCCGATGG CGGCCGACCT GGCCAAGGTC
AGCCTCTGGC TCACCGCCAT GAGCCCCGGC CGCCCCCTAT CGTTCCTCGA CCACCACATC
AAGGTCGGCA ACGCCCTCCT CGGCACCACT CCCGCCCTCA TCCATGGGGG CATTCCCGAC
ACCGCCTACG TCGCCCTCAC CGGCGACGAC AAGTCCGCCG CGTCCACATT GAAGAAGCGC
AACGCCACCG AACGCGGTCA GGGCGACCTC TTCGACGACG CCGGCATCGA TATCGACACC
GCAAGCCTGC GCAAGGCCAC CGCCGAGATC ACCGACCGCG CCGCCGCAGC CACCACCGTG
GACGACGTCG CCTGGGCCGC CCAGCGCTAC GCCGACCTAC AGAGCGATCC CGACATCATC
CGCGCCCGCC GCGTCGCCGA CGCCTGGTGC GCGGCCTTCC TCGGCCCCAA GACCGCCGAC
GCCGAGCCGA TCACCCACCG GGCCCTGACC GCCATCGCCG ACGAAGCCGC TCCCGACCCC
GTTGTGAAAG CCGTCGACGA GCTCGCGACC CGGCACCGGC TGTTCCACTG GCATCTGGAG
TTCCCCGACG TGTTCCGCGT GCCCGACGAC GGTCTCGCGC GCGGTCCGTA CGGCTGGACA
GGTGGCTTCG ATGCGGTCCT CGGCAATCCA CCGTGGGAGC GCATCAAGCT CCAGGAGCAG
GAGTTCTTCG CGATCCGCGA GCCGGCCATT GCCGAGGCGA AGAACGCCGC CGCCCGCAAA
AAGGCCATCG CGGCACTCGC CGAGACCGAC CCAGACCTGT TTGGCGAGTT CAACGCCGCG
CGCCGACAGA GCGAAGCCGA GAGCCAGTTC CTCCGCGGCA GCGGCCGTTA CCCGCTGTGC
GGCGTCGGCG ACGTAAATAC CTACAGCGTC TTCGCCGAGC ACTTCCGCGC CACCCTCGCT
CCGACCGGCC GCAGCGGCAT CATCACCCCG ACCGGCCTTG CGACGGACGC CACCACCGCC
GCCTTTTTCG CCGACACGAT CACGTCTGGG CGCCTCGCGG CCTTCTTCGA TTTCGTCACC
GGACCGGAGA TCTGGAGCGG GATAGGCCAC AACAGGTTCC GCTTCGCCGT GTCCTCGACC
ACTGGTGGTG AGCGCATCCC CGAGGCGCAG CTCTCCTTCG ACAACCGGCA TCCGCGCGAC
CTTCAGATCG CAGATCGCAA ATACAGCCTG CCATCCGATG ACCTGGTCCT GTTGAACCCG
AACACCGGAA CGTTGCCCAT TTTCGCGGAC ACCCGCGACG CAGAGGTGAC CCTCGCCTGC
TATCGCCGAC ACCCCATCCT GATCCGCGAC GGCGGACGCA ATCCCTGGGG GCTTCGCTTT
TCCCGTCTCT TCGACATGGC AAACGACAGC GCTCTTTTCC ACACCGTCGA GGACCTTGAA
GACCTCGAGG CCACCTTCGA CGGCTGGGCC TGGACCCACG CCGACCAGCG CTGGCTGCCT
CTGTACGAAG CGAAGATGCT GAGCCACTGG AACTCCCGCT TCTCGGGATA TGCCGACGTC
CCCGAGGGCT ACCAGGGGAC TGCGTTGCCG CGACTCACCG ACGAGCGGTT GGACGACCCC
GCCTCCGAAC CGATGGCCCG CTACTGGGTG CCTGAGGCCA ACGTCACCAA GGCGATCCCC
GAGGGATGGG ATCGAAACTG GTTGTTCGGG TGGCGCGACA TCGCCCGATC CAGCGACATG
CGCACGTTCG TCCCGAGTGT GCTCCCACGC GCCGCGGTGG GGGACAAGTT CCTGCTCGCC
TTCGCAGCGG CTCCGAGCAA GACGCCGTTC TTGCAGGCGG TTTGGTCCTC GCTGATCTTC
GACTACATCT CGAGGCAGAA GATCAGCGGC ACAGGAATGA AGTACTTCTT GACCAAGCAA
TTGGCGTGCC CAGAGCCGGA GGCATTCGAT GGCGTACCAG CTTGGTCTCA AGAGCCCTTG
GGGGCCTTTG TGCGGGCTCG AGTTCTGGAG CTGACCTACA CGAGTGAAAG GCTCGCAGCG
TACGCCGTCG ACGTTCTTTC AGGCGAGCCC GGCACAACGG ATCCCGGGCC GCCGTTCCGA
TGGGTTCCTG AGCGCCGGGA GCAGCTGCGC GCCGAGCTTG AGGCCGCCAT GCTTTGCCTC
TACGGCCTCG ATCGCGAGGA TGCGGAATAC GTCCTCGATT CGTTCGTCTT GGTATGCAAG
TACGAGGAGC GCGACCACGG GGAGTTCCGG ACCAAGCGGC TCGTGCTTGC CGCCTACGAC
GCCATGGCAG CCGCTGCCGA GAGCGGCGTG CCGTTCGTCA GCCCGCTGGA CCCGGCCCCC
GGCGAAGGCC CTCGACACCT GGAGCGTGAG TCGTGA
 
Protein sequence
MNGGLTSVRV AGALMPGDVL SAVLAGDLDG LTGSAYHLGS ESPREAAARV WTHLLGVYRR 
FRSDLDSLPD EDPAVGLTRE RWLTLLLSEL GYGRVPPTPA GGLAVGDKQY PVSHLWGATP
MHLLGWGVPL DKRSAGVAGA ARAPHAMVQE LLNRTDEYLW AVVANGRVLR LLRDSTTLTG
QAYVEFDLEA MFDGELFAEF ALLYLLCHQS RVEVPDDGQP ADCWLERWRV TAVSQGVRAM
TLLRDGVELA LETLGTGFLQ HPANTKLRDR LERGEIRLSD VHAALLRLAY RLLFWAVAED
RDALLSPDAS QEQRRRYAEH FSSTRLRRLA VRRHGSGHDD LWQAATFVLD ALGRKEGEQR
LGLPGLGGLF APTAADVLAG SRLPNAALLT AVRSLAVVQP KGQPQRLVDF AHLGAEELGS
IYESLLELVP RHDPTTHAFT LETLAGNDRK TSGSYYTPTE LVELVLDTAL DPVLDDAEKN
AHTTEEAEAA LLGLRVCDPS VGSAHFLVAA ARRIATRLAT VRTGEVDPTP TAYSDAMHDV
VARCVYGIDI NPMAADLAKV SLWLTAMSPG RPLSFLDHHI KVGNALLGTT PALIHGGIPD
TAYVALTGDD KSAASTLKKR NATERGQGDL FDDAGIDIDT ASLRKATAEI TDRAAAATTV
DDVAWAAQRY ADLQSDPDII RARRVADAWC AAFLGPKTAD AEPITHRALT AIADEAAPDP
VVKAVDELAT RHRLFHWHLE FPDVFRVPDD GLARGPYGWT GGFDAVLGNP PWERIKLQEQ
EFFAIREPAI AEAKNAAARK KAIAALAETD PDLFGEFNAA RRQSEAESQF LRGSGRYPLC
GVGDVNTYSV FAEHFRATLA PTGRSGIITP TGLATDATTA AFFADTITSG RLAAFFDFVT
GPEIWSGIGH NRFRFAVSST TGGERIPEAQ LSFDNRHPRD LQIADRKYSL PSDDLVLLNP
NTGTLPIFAD TRDAEVTLAC YRRHPILIRD GGRNPWGLRF SRLFDMANDS ALFHTVEDLE
DLEATFDGWA WTHADQRWLP LYEAKMLSHW NSRFSGYADV PEGYQGTALP RLTDERLDDP
ASEPMARYWV PEANVTKAIP EGWDRNWLFG WRDIARSSDM RTFVPSVLPR AAVGDKFLLA
FAAAPSKTPF LQAVWSSLIF DYISRQKISG TGMKYFLTKQ LACPEPEAFD GVPAWSQEPL
GAFVRARVLE LTYTSERLAA YAVDVLSGEP GTTDPGPPFR WVPERREQLR AELEAAMLCL
YGLDREDAEY VLDSFVLVCK YEERDHGEFR TKRLVLAAYD AMAAAAESGV PFVSPLDPAP
GEGPRHLERE S