Gene Noca_4737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4737 
Symbol 
ID4595472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp35200 
End bp37470 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content71% 
IMG OID639772526 
Producthypothetical protein 
Protein accessionYP_919186 
Protein GI119714044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0599704 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTTGCCGG TGACTGTACC GGCGTCGCGT GGCGATGCCG CACACCTTGA TGCAGGTCGG 
CGTGTCTCGC CGGCGCTGTT GGCGCTGGAT GAGCACCTGT TCAACCCTGA CGCGTCGCAG
TGGCTGGTGA TCGACTCCGG CAACACGGCT GTTCCCCGGA CCGTGGTGCC GTCGCTGGTC
GACGGGCTCG AGCTGATCGG CAAGGGCCGT ATCGCCGCCC TGGTCGGGCA CCTGCGCGAC
GGCGTCGGCG TGGTCGACGT CGACGTGCCC GGCGAGTTCG GTGACTTCCT CGCGGCCGAG
GTCGCCGACT GGCTGTCCCG CCGCGACTGC TGGGTGCTGG AGCGGCCCTC CGGCGGCGCG
AAGGGTCGCT GGCACATCTT CTTCGCCCAC TCCGACTTCC ACTACGCCCC CGCCGCTGCC
CGGGCCGGGT TCGCGGCTGC GGTCAGCGGC TTCCTCACAG CCCTGGCCGA GGACGTGAAG
GTCCCGCGCG GTGAGCTCGA CCTGCGCGAC GCCGTGCGTC CCCTTTCCTC ACCTCACCGG
TTCGGTGCGG TGACCAGGCC GAAGGGCGAG CTCCGTGAGG CGCTGCGCGG CCTGAAGCGA
GTGCTCCCGG ACCCGCCCTC GCCGTCGCCG CTGCGGCCCC GCGCCAAGGT GAAGCCCACC
AACGCTGGCC ATAAGTCCAC TGCGACCGGC TCGGGCCTGG TGGTGCCGTT GGCGTTGCAG
CGGTGGAAGC GCCAGCTGCG CACTGAGTGG CGCAATTACC TGCTGACCGG GGAAATCCCG
GCCGGGTCGT GGGCCGCGGG CGCGACGAAG ACCCGCGCCG CGGTCGAGGT CGACCGATCC
CTCGTCGAGG CGGCCTGCAC CCGGGAGATG GTGTGGGCGA TCGGCGACCC GGAGATGGCC
TGGCGGATCA TCCGTGAGTC GCATCCGACC GCGATGACCA AGGCCAAGCA CCAGGGCTAC
TCGTGGTGGC TCGGCTACGT GTGGAACGAC CTCGTCCGCT CGGCCAGCGA GTTCAACACC
ACCAGCGAGA AGCCCAGGCA GGTCGAGGCG CCACCGGTCG AGGTCGTCGA GGCGGTAGCG
GCTGCCCGCG CCGAGCTAGG TCGTCTGATG TGGTCTGTCC CGGACCGCCA GCGGGCGGCG
CTGCTGCTTG TCGGGCACCA CCTCCTTGAC CGCGTGCTGC GCAAGCGAAA CCTGCGTGTG
CCCTGCCCCG AGCGGGACCT GCTGCTCGAC ACCGGGCTGG GCGACCGCAA GACCGTCCGC
GCCGCCCTGG CCCGCCTCAA CGGCCGCCTC GGGACTCTCC ACACCGACTG CCTCTCCCCC
GTCGAGCGGG ACTCCACGAG CTACGAGTTT GAAATCAACC AGGCCCCGGA AGGGGAGGGA
CGGCAAATCC CCCCACCTGG GTTTGACCCA CCCCCCGCCC CCCGCGGCCT CTGGGCGACC
CTCCCCCGCT CCAGCCACAG CCTCTGGCGC ACGCTCCTGA CCTGCTCGAC CCCGCTGGAG
CTGGGGGATC TGGTGGTGAA GGCAGGGCTG GTGAAGGCTG CAGGTGATGA GGTGTCGAAG
TCTCAGAGGT CCACGGCGAA GGCGGCGCTG GTCGCGCTGA GCAAGGCTGG GATGGTTCGG
GTTGACGAGA ATGGATGCTG GCAGGCAGCC ACGCGGCCCC GGTCGGTTCA GGTCGAGCAG
GACGCGGCCG CGGCGTACGC CCGTCAACTG GAGACGATCG AGGCAGAACG AGCCGCCTAC
CGCGCCGGCA CGACGTCGAG CTGGACCGCA GGCCGGGCGC GCGCCATCAA GGCACAGAGG
GCCAAGGAGA AGGCCTGGTG GGACAACCTC TCCCCCGCCG CCCGCGCCGA ACGTGCCGCA
GCGAAGCGCC TCGAGTTCGA CCAGATGTCG ATCAGCCAGC AGGCCGCGCT GAAGTCCCGT
CTCGCCGAGC GACGCATGCG CGCTGGTATC GACGAGCTCG AGACCTACCA GACCTGGCTA
CGTAGCCTCC CGGCCGACGA GTACGTCGCC CGCAGCCTGG AGCGCAAACA ACGTTTCCAG
GCCCTCTCCC CCGCCGAACG AGGCGCCTCC GTCGCCGCCT GGGATCGCCA CCGGCTCCGC
TACGGCCTCA CCGCCCAGCG CCTCGCCACA CCCGCACTCG ACGCGCGGAC AGCGACCCCT
GACGTCGAGC AGGCAGCACT CCTGCCCGAC GGAGTCGCCG CACGCGATGC AACTTTCCTC
GAGCGCCAGG GCAACCTTCT CGACGACGTC GAACGCCAGG CCGCGGGCTA G
 
Protein sequence
MLPVTVPASR GDAAHLDAGR RVSPALLALD EHLFNPDASQ WLVIDSGNTA VPRTVVPSLV 
DGLELIGKGR IAALVGHLRD GVGVVDVDVP GEFGDFLAAE VADWLSRRDC WVLERPSGGA
KGRWHIFFAH SDFHYAPAAA RAGFAAAVSG FLTALAEDVK VPRGELDLRD AVRPLSSPHR
FGAVTRPKGE LREALRGLKR VLPDPPSPSP LRPRAKVKPT NAGHKSTATG SGLVVPLALQ
RWKRQLRTEW RNYLLTGEIP AGSWAAGATK TRAAVEVDRS LVEAACTREM VWAIGDPEMA
WRIIRESHPT AMTKAKHQGY SWWLGYVWND LVRSASEFNT TSEKPRQVEA PPVEVVEAVA
AARAELGRLM WSVPDRQRAA LLLVGHHLLD RVLRKRNLRV PCPERDLLLD TGLGDRKTVR
AALARLNGRL GTLHTDCLSP VERDSTSYEF EINQAPEGEG RQIPPPGFDP PPAPRGLWAT
LPRSSHSLWR TLLTCSTPLE LGDLVVKAGL VKAAGDEVSK SQRSTAKAAL VALSKAGMVR
VDENGCWQAA TRPRSVQVEQ DAAAAYARQL ETIEAERAAY RAGTTSSWTA GRARAIKAQR
AKEKAWWDNL SPAARAERAA AKRLEFDQMS ISQQAALKSR LAERRMRAGI DELETYQTWL
RSLPADEYVA RSLERKQRFQ ALSPAERGAS VAAWDRHRLR YGLTAQRLAT PALDARTATP
DVEQAALLPD GVAARDATFL ERQGNLLDDV ERQAAG