Gene Noca_4898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4898 
Symbol 
ID4595268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008697 
Strand
Start bp230236 
End bp231711 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content72% 
IMG OID639772683 
ProductATPase domain-containing protein 
Protein accessionYP_919343 
Protein GI119714201 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value0.246184 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.165682 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACCA CGCGTGGAGC CGCGGTCGCC GCCTGGTGGC CGTGGCTCGC GCTGCCCGCT 
CTGTCGACGG GCCTCGTCGG CGCCGGCCTG GCCCTCGGCC TGACAGACCC CGACGGCAAC
TGGATCAAGG GTGAGGCCTG GACAACCGTT CCGCTGGCCT TCGGGTTCAC CACGGTCGCG
GCCGGGATCT GGTCGACTCG CCCGCACCCC GTTGGCCTAC GCCGTCTCGC TGTGCTCTAC
ACCGTGGTCG GGCTCGGTGC CGCGTGTGCG CTCCCGGCCC ACGGCTGGGC CCGCAGCGAC
GTCGCCGGCG ATGCGCTTGC GGCCTGGGTC TCGAACTGGG TGTGGTCGCT CGGTGCTGCG
CCGCTCCTGG GGATCGGGCT GCTGCTCTAT CCTGACGGCC GCCTGCCCGG TGCCCGGTGG
TGGCCCGCTG CCGGGATCGG CCTCACCGGT ATGGCGGCGC TCACCGCGTC CGCGGCGCTC
AGACCGGGTG CGCTCGAGGA CCACCCGAGG TTCGACAACC CGGCCGGCTT CGGGAACCGG
GGCTTCTGGG ATGCGGCCGG GGGTGTCGGC TTCGTGCTCC TCATGTCGGG CGCGGCACTC
GGCGTGGCCG CGCTGGTCGT CAAGTTCCGG CGAGCTCCGG CCGGCAGTGA CATCCGCGGC
CAGATCGGTG GCTTCATGCT GGCTGGTTCG CTGATCGTCG TTGTCGCTTC CATCCCTGAA
ACCGACGACC TGGGCATCAC GCTGCTCGGG CTGGTCACCG TGACCGCCCT GCCCGTCACG
GTCGGCACCG CCGTCATCCG CCACTCCCTT CTCGACCAGC GCGCCGACGT CGAGAAGCTC
AACCGGCGGG TCCGGGACCT CTCGACCTCG CGCCGGCTCA TCGTCAACGA GCGCGAGAAG
GAGCGGGTCG CGCTGCGGCG GGACCTCCAC GATGGGCTCG GCCCCTCACT TGCCGCCATC
GGTCTGGGGC TACGGCAGCT GGAGCAGAAG ACCGGCGGCG GCGACGGCGT ACGCGAGATG
GCCGACGAGG TCCAACGCGC CGTCGCGGAG GTACGCCGGA TCTGCGACGG CCTGCGCCCC
GCAGCTCTCA ACGAGCTCGG CCTGGCCGGC GCGCTGACAG AGTCGATCGA GCCCTTGCAG
CGCTTCGGCC CACGGATCAC CCTCATCATC GAAGAGCTTC CACGACTGAG CCCGGCAGTC
GAGGTCGCCG CGTTCCGGAT CGTGATGGAG GCGGTCACGA ACGCCGTACG CCACGCCGAT
GCCCAGCACG TCCAGGTCAA CCTCGGGTAC GCCGACGGCG TCACGGCGCA GGTGACTGAT
GACGGCCGGG GTATCGCCGA AGACCGTGTT CCCGGAGTCG GTCTGCGCGG CATGTCGGAC
CGTGCCGACG AGGTCGGCGG CCGACTCATG GTGAGCGCGG CGGTCCCGAC CGGCACATCC
GTCCACGCCT GGCTCCCGGC GGCCGACCAT GACTGA
 
Protein sequence
MSTTRGAAVA AWWPWLALPA LSTGLVGAGL ALGLTDPDGN WIKGEAWTTV PLAFGFTTVA 
AGIWSTRPHP VGLRRLAVLY TVVGLGAACA LPAHGWARSD VAGDALAAWV SNWVWSLGAA
PLLGIGLLLY PDGRLPGARW WPAAGIGLTG MAALTASAAL RPGALEDHPR FDNPAGFGNR
GFWDAAGGVG FVLLMSGAAL GVAALVVKFR RAPAGSDIRG QIGGFMLAGS LIVVVASIPE
TDDLGITLLG LVTVTALPVT VGTAVIRHSL LDQRADVEKL NRRVRDLSTS RRLIVNEREK
ERVALRRDLH DGLGPSLAAI GLGLRQLEQK TGGGDGVREM ADEVQRAVAE VRRICDGLRP
AALNELGLAG ALTESIEPLQ RFGPRITLII EELPRLSPAV EVAAFRIVME AVTNAVRHAD
AQHVQVNLGY ADGVTAQVTD DGRGIAEDRV PGVGLRGMSD RADEVGGRLM VSAAVPTGTS
VHAWLPAADH D