Gene Noca_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1778 
Symbol 
ID4597690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1887550 
End bp1889430 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content70% 
IMG OID639776378 
Producthypothetical protein 
Protein accessionYP_922978 
Protein GI119716013 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.294747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGCC GAGCTGCTGC CCCCCTCCCG GTCTCCCCCC CGGCCCCCAT CCCGGCCTGC 
GTGGCCGTCC TCCTGGTCGC CGCGGCGACC GTGCTGACGA TGCTCGCCGG AGCACCACCC
GCCGGCGCCC ACGAGGAGCG GCCCGCGCAG TTCCCCGACG GCACCGGGCA CCGGCCGTCG
TTCCTCGGCT TCGACAACCC CCGCCAGCGG GTCGTGTGCG GGTCCGACAG CGGCCGGCTG
ATCGCGGCGC TTCCCGAGGG CGCGGTCAAG GACCGCAACG AGCGGCTGCT CGACTCGTGC
CGGTACCGGT CCATCCAGGA CGCGATCGAC TCGGTCCGCA AGCGGCAGAC CTCCGTGTAC
GTGCTGCCGG GGACCTACCA CGAGGCGAAG TGGGCGCGCG CGGAGCGCAG CGACTACTGC
TCGCACCTGC AGACCTCGTC GACCTCGCCG CTGCCGTTCT CGGAGTACAT CGGCAGCCTG
TCCTCGCCCG ACCCCGGTGC CGAGCAGGCC GCCGCGGACT CCGGCGAGTC CAACCCGATC
GCGTTGTCCT ACGCCGACCA GCGGCGCTGC GCCCACAACC TGAACCTGAT CGCGGTCTTC
GGTGACCGGA CACCGGGCAA CGACAGCATC CGCTGCGACA GCCGGTTCTG CGGCCTCCAG
CTGGTCGGCA CCGGGCAGAC ACCGGCCGAC GTGGTCGTCG ACAACCGGTT CGCCAAGCTC
AACGCGATCC GCGCGGACCG GGCCGGCGGC TTCTACCTGA GCAACATGAC CTTCCAGCAG
GCCGAGTTCA ACGCGATCTA CGTCCTGGAG ACCGACGGCT TCGTCATCGA CCGGGTGGTC
GCCCGCGGCA ACGACGAGTA CGGCGTCCTC GCGTTCGCGA GCGACCACGG CCTGATCGAG
GACTCCGAGG CGTACTACAA CGGCGACTCC GGCATCTACC CGGGCTCCGG ATCCGACCTC
AACGCCGACA ACACGGAGTT CGAGGCGACC CGCTACGCGA TCGAGATCCG CGGCAACAAC
AGCCACGACA ACACGCTGGG CTACTCCGGC ACGGCCGGCA ACTCGATCTG GGCCCACGAC
AACGACTTCC ACGACAACGC CACCGGCATC GCGACCGACT CGCTGTTCCC CGGACACCCG
GGCCTGCCGC AGGACCACGC GCGCTGGAAC CGCAACCGGA TCTACTCCAA CAACTCCAAC
TGGTACACCG AGTTCGTCGA CACCGGCGTC TGCGACAAGC CGATGGAGCA GCGCGGCTAC
ATGGACGGCA CGGTCTGCCC GGTGGTGCCG ACGCCGGTCG GCACCGGCGT CCTGATCGCG
GGCGGCAACT ACGACTCGAC CGACCACAAC TGGATCTACG ACAACTGGCG CTACGGCACC
ATGCAGTTCT GGGTGCCCGC CCCCCTGCGC GACGACTACG ACCCGTCGCA CCTCTACGAC
ACGTCGAACC ACAACCACAC CTTCGAGAAC CAGATGGGCA TCGACCCGCA GGGCCACGAG
CAGCCGAACG GGATGGACCA CTGGTGGGAC GACCAGGGCG TGGGCAACTG CTGGGAGGAC
AACCACTACG GCTCGGCCGG CCAGACCGAC AACTTCACGG TTCCGCCGCC CTCCTGCGCC
GACGGCGGCT CGGTCTTCCT GCCCGGCGCG ACGGTGAAGG ACGCCGGCTT CCTCAGCTGC
AGCCAGTACG ACCGCAGCGA CCCGACCTGG AAGCACCCGC CCGCGTGCGA GTGGTTCGAC
AGCCCCAGCA AGCCCGCGCG GGCCGTGGCC GCTCCGGGTC GGCCCGCCTC CGGCACGGTG
CTGGTGGCCC CGATCGCGAT GACGGCCGGC GCGCTCGCGT TGGTCCTGGG CCTGCGCCGC
CGGTGGTCGC GTGCGGACTA G
 
Protein sequence
MTRRAAAPLP VSPPAPIPAC VAVLLVAAAT VLTMLAGAPP AGAHEERPAQ FPDGTGHRPS 
FLGFDNPRQR VVCGSDSGRL IAALPEGAVK DRNERLLDSC RYRSIQDAID SVRKRQTSVY
VLPGTYHEAK WARAERSDYC SHLQTSSTSP LPFSEYIGSL SSPDPGAEQA AADSGESNPI
ALSYADQRRC AHNLNLIAVF GDRTPGNDSI RCDSRFCGLQ LVGTGQTPAD VVVDNRFAKL
NAIRADRAGG FYLSNMTFQQ AEFNAIYVLE TDGFVIDRVV ARGNDEYGVL AFASDHGLIE
DSEAYYNGDS GIYPGSGSDL NADNTEFEAT RYAIEIRGNN SHDNTLGYSG TAGNSIWAHD
NDFHDNATGI ATDSLFPGHP GLPQDHARWN RNRIYSNNSN WYTEFVDTGV CDKPMEQRGY
MDGTVCPVVP TPVGTGVLIA GGNYDSTDHN WIYDNWRYGT MQFWVPAPLR DDYDPSHLYD
TSNHNHTFEN QMGIDPQGHE QPNGMDHWWD DQGVGNCWED NHYGSAGQTD NFTVPPPSCA
DGGSVFLPGA TVKDAGFLSC SQYDRSDPTW KHPPACEWFD SPSKPARAVA APGRPASGTV
LVAPIAMTAG ALALVLGLRR RWSRAD