Gene Noca_4784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4784 
Symbol 
ID4595372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp98323 
End bp102420 
Gene Length4098 bp 
Protein Length1365 aa 
Translation table11 
GC content65% 
IMG OID639772571 
Producthypothetical protein 
Protein accessionYP_919231 
Protein GI119714089 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0358899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCGCG CCACCACCCG AGCCCGAGCC GGGTTCATCG CTCCGAGCGC GACCGAACTG 
CACCGCGCCT GGCTTGAACT CGTCGACACC GAAGGCCCCT TCCTCGCGAT CCCGCCGCTC
AAGCGGGTCT GGCCGCAAGG CATGCCCAGC CTCAGCGACG ACCGTAAAGG AGCGCTGGCC
GATGCCCGTA AGGAATTCGA GGCGGCATGG GAGAAGCTCG ACCGCGCTCC CGACAACGAC
GCCGTCCTCG ACGCCTACCG CGTCGCCCGC GACAAGTGGG TCGAGACAGT CCTGCGCGAC
GTCGCGGGCT GGGCCGAATC CTTGTCGTGG GGCGAATTGC CCGGCGTCCA GGCTCGATCG
CCCAACCACG CCGTCACGGT CACAGCCCAG GCCTCACTTG GCGGCGCCGA CGGGATCGGC
GCGCTCGTGC ATGTCATTGA CCCGACCGAC TCCCTGCGCC AGACGCCCAA CGACCTGTGG
GCCGCAACGC CCGTCGACCG CACCGAGGCG CTACTACGCG AGAACAGGAT CCAGCTCGGC
ATCGTCACCG ACGGACGCTG GTGGGGCTTG GTCTGCGCCC GCGACGGCGC GATGGCCGCC
TCCGGTGTGG TCGACGCCCT GACCTGGATC GAGGAGCCCC GCACCCGCGA CGCCTTCCTC
GCCCTGATCG GACGCCAGCA CATCATCGGC GGCGACCCAG CCGAGCGGAT CCCGGTGCTG
TTTGAGGAAT CGGTCGCCGC GGCCGAGGAG ATCACCGAAG CCCTCGGGTC ACAGGTGCGC
AGCGCAGTCG AACTGCTCAT CCAGTCGTTC TCCGAGTCCG CCGCCGACGC CAGCAGACGC
GGCCTGCCGG ACCCGCTCCC GACCCTGACA CACGCCACGT ACGAAGCCGC TGTCACCGTG
ATGATGCGCG CAGTATTCCT CCTTTTCGCC GAGGAGCGAG GACTGCTGCC GTCGGGCGAG
CTCTTCGACC AGGGCTACGG CATCGCCCGC GAACTCGACC GCCTCATCGC CCGTGAGACC
GAAGATAGCG AAGAGGCTCT CGACGCCACC TCCCTAACCT GGCATCGACT TCTGGCCACC
AGCCAAGCCC TGTTCAGCGG CGCTTCCTTC GAGAGCTTGC GCATGCCCGC CTACGGCGGT
TCCCTGTTCG AGCCCGGGCG GTTTCCGTTC CTGACGGCCA CCAACGAGGG GGGCACTCTC
GCGGTGACCG TCTCAGACCG GGTCATGCTG CATGTGCTGC GGTCGGTCCA GATCGCCGAC
ATCAAGGGTG AAGCTCGGCG CATTTCCTTC CGGGACATCG ACGTCGAGCA GATCGGCTAC
ATCTACGAGG GTCTGCTCGG CTACACCGCT GCCAAAGTTG GGGAGACTTA TGTCGGGCTG
AAGGGCACCA CCGGAGTCGA GCCAGAGATT CCGCTGGTAA TCCTCGAAGA ACTCGCCGAG
GCCAACCCCG ACGCCAAGAA GCTCGCTGCC GCGATCCGCT CGTGGATCGA GGAAGTCCAA
CCTTCGGCGA GGGCGTCGTC TGCGGCAGCG ATTGCCAAGG CGATCGGTGC CGCCGTCGAC
CCGAGTGTGC TCAGCGCCCT CACCCAAGCG GTCGGCGATG ACCCTGAGCT GAGGGACCGC
GTCCTGCCGT GGCTCGGCCT CGTCCGGCTC GACCTGCGCA AGCGCCCCTT CGTCGTCCTG
CCCGGTGGCC TGATGGTCAA AGAGACGCCG TCGCGGAAGA ACGCCGGCGC CCACTACACG
CCCAAGTCAC TGGCTGAGGA AGTCGTCCTG CACGCACTCC AGCCGCTCTG CTACTCGCCT
GGCCCGCACC AGACGGCGGA CGAGACTGAG TGGAAGCTCA AGTCCTCCGA TGAGTTGCTC
GACCTCAAGG TCGCCGACAT CGCCTGCGGC TCGGGAGCCT TCCTCGTCGC CGCCGCCCGC
TACCTCGCCG ATCGCCTAGT CGAAGCGTGG ATCTCCGAAG ACCCGGTCAA CGCAGGACGT
AAAGACCTTC ACACTCGAGC AGTCCGACAG GTCGTCGCCA ACTGCCTCTA TGGCGCCGAC
ATCAACGGCA TGGCAGTCGA GATGTGCAAG CTCTCGCTCT GGCTGGTCTC GTTGGACCGC
GACCTGCCGT TCTCATTCGT CGACGACAAG GTCTTCCTGG GAAACTCGCT CCTCGGGCTG
ACGAGCCTGG ATCAACTCCG ACGCATGCAC ATCGACCCGA AGAACGCACG TGCCGACGAA
ACGTTCCACG GCCAGCTCAT CGACGTGAAC GCGATCATCC AGCGGATTGT TGAGTTGCGA
CGCCGACTCC TGAGCGAGAT CGATGAGGCT GACCCGAATC GGACGAGCGC GGCCAAGCGT
CGTCAGCTCC GACGATTGCG TGAGATCACC TCAGATCTCC GCAAACTCGC GGACGGCATC
GTGGCTGCGG GGCTGCCCTA TGGCGGCAAG TCCGGGAAGG CCCTAGACGA GGCCTACGAG
AATCTGCGTA TTGCCGTCCG TGCCGCGTAC CCGGAGGATG GCGCAGGGGA CTCGTCGTTC
CTTACGGCTG TCATTGATGC AGGGCTGGTG CCGACCGTTC CTACCGACTA TGAGCGATGG
CAGCCTCTGC ATTGGATCCT CGAGGCGCCC GATGTGTTGG TCGAGAATGG CGGTTTCGAC
GCGGTGGTCG GAAACCCACC GTTCCTCGGT GGCCAAAAGC TTTCTGGGGC GATGGGCACG
CCCGTTCGCG ATTGGTTGGT GGATGTCCTA GCCGATGGTC GTCGGGGGAG TGCGGACCTT
GTCGCCTACT TCTTCCTCCG GGCTGTCGGC CTGCTCTCGC ACGGCGGTGG ACTGGGCCTA
ATCGCCACCA ACACGGTTGC GCAAGGTGAT AGTCGTGCAG TTGGTCTGGA CGCCATGGTC
GCCCGCGGCT TCACGATCAC ACGGGCAATC CAGTCGCGCT CTTGGCCTGC GTCGAGTGCG
AACCTGGAAT ATGCCGCCGT CTGGGGGACT ATCGCCTCCG TACCGGAAAC CGTGCCTCGC
ATTGCAGATG GCATTCCGGT ACGCCGCATC AGCACGCTGC TTGAGCCAGC TGGGCGTGCC
GAGGGAAGCC CGATGCGCTT GATTGAGAAT GCCGGGATCG CGTTCCAGGG ATGCATCGTG
CTTGGCATGG GGTTCGTGCT TGACCCCGAG GAGGCGATGC GTTGGATTGC CGAGGATCCC
CAGAACGCCG AAGTCTTGTT TCCCTATCTG AACGGCGAAG ACCTCAATCA GCGGCCAGAC
GGTTCTGGAT CACGCTGGGC GATCGACTTT AACGACTGGC CCGAGGAGCG TGCTCGCAAG
TTCCCATTGC CGTATGAACG AGTCGCGGAG CGGGTGCGTC CCGAACGGCA GAGAATGAAG
CCGAATGGCG AGTTCGCGCT GAGGAAGCCG TTGCCCGAGA GGTGGTGGCA GTACGCCGAG
AAGCGGCCAG CTATGCGAAA GGCGATCGCC AACCTTGACG AGGTACTGGC TGTCACCCGC
ATCAGCAAGC ACGCTGTTGT TGTGCGGGTG CCAACGGGAC AGGTGATCAA CGACGGCATC
GTTGCGTTCA TGACCGACTC GTATTCGACG CAGGCTGTCC TCTCGTCGAG CTTGCATCTG
TCATGGGTGA TCAAGAACGG GTCGTCCTTC GAAACTCGAA TCATCTACAC GCAATCCGAT
GCGTTCGATA CATTCCCCCG CCCGAAGCCG ACAGTCCGCC TTGAGTCCGT AGGAAGAACC
CTGGACGAGG AACGGCGGGA GATCATGCTC CGCCGCGACC TCGGCCTGAC GAAGCTCTAC
AACCTGGTGA ACGACCCGGA GCTTCAAGCT GATGTGGATG TCACGCGCAT GCGCGAGATC
CATGTCGAGG TGGACGAGGC GATGATGGCG GCGTACGGCT GGGGGGACGT CCCTCTCCGC
CACGGCTTCC ATACCTACCG CCAGGTCGAG CGCTGGACGG CCTCACCTCC AGCACGCGTC
GAGATCCTCG ACCGTCTCCT GGAGGAGAAC CACAAGCGCG CGGCGGCGGA GTCTACGAGC
GACAAGACGA AGGGTCAGCC GCCCGCAGTG GCAGATGAAA GCCAGGCCAG CCTCTTCGCG
GACAGCGTCG ACGAGTAG
 
Protein sequence
MARATTRARA GFIAPSATEL HRAWLELVDT EGPFLAIPPL KRVWPQGMPS LSDDRKGALA 
DARKEFEAAW EKLDRAPDND AVLDAYRVAR DKWVETVLRD VAGWAESLSW GELPGVQARS
PNHAVTVTAQ ASLGGADGIG ALVHVIDPTD SLRQTPNDLW AATPVDRTEA LLRENRIQLG
IVTDGRWWGL VCARDGAMAA SGVVDALTWI EEPRTRDAFL ALIGRQHIIG GDPAERIPVL
FEESVAAAEE ITEALGSQVR SAVELLIQSF SESAADASRR GLPDPLPTLT HATYEAAVTV
MMRAVFLLFA EERGLLPSGE LFDQGYGIAR ELDRLIARET EDSEEALDAT SLTWHRLLAT
SQALFSGASF ESLRMPAYGG SLFEPGRFPF LTATNEGGTL AVTVSDRVML HVLRSVQIAD
IKGEARRISF RDIDVEQIGY IYEGLLGYTA AKVGETYVGL KGTTGVEPEI PLVILEELAE
ANPDAKKLAA AIRSWIEEVQ PSARASSAAA IAKAIGAAVD PSVLSALTQA VGDDPELRDR
VLPWLGLVRL DLRKRPFVVL PGGLMVKETP SRKNAGAHYT PKSLAEEVVL HALQPLCYSP
GPHQTADETE WKLKSSDELL DLKVADIACG SGAFLVAAAR YLADRLVEAW ISEDPVNAGR
KDLHTRAVRQ VVANCLYGAD INGMAVEMCK LSLWLVSLDR DLPFSFVDDK VFLGNSLLGL
TSLDQLRRMH IDPKNARADE TFHGQLIDVN AIIQRIVELR RRLLSEIDEA DPNRTSAAKR
RQLRRLREIT SDLRKLADGI VAAGLPYGGK SGKALDEAYE NLRIAVRAAY PEDGAGDSSF
LTAVIDAGLV PTVPTDYERW QPLHWILEAP DVLVENGGFD AVVGNPPFLG GQKLSGAMGT
PVRDWLVDVL ADGRRGSADL VAYFFLRAVG LLSHGGGLGL IATNTVAQGD SRAVGLDAMV
ARGFTITRAI QSRSWPASSA NLEYAAVWGT IASVPETVPR IADGIPVRRI STLLEPAGRA
EGSPMRLIEN AGIAFQGCIV LGMGFVLDPE EAMRWIAEDP QNAEVLFPYL NGEDLNQRPD
GSGSRWAIDF NDWPEERARK FPLPYERVAE RVRPERQRMK PNGEFALRKP LPERWWQYAE
KRPAMRKAIA NLDEVLAVTR ISKHAVVVRV PTGQVINDGI VAFMTDSYST QAVLSSSLHL
SWVIKNGSSF ETRIIYTQSD AFDTFPRPKP TVRLESVGRT LDEERREIML RRDLGLTKLY
NLVNDPELQA DVDVTRMREI HVEVDEAMMA AYGWGDVPLR HGFHTYRQVE RWTASPPARV
EILDRLLEEN HKRAAAESTS DKTKGQPPAV ADESQASLFA DSVDE