Gene Noca_1933 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1933 
Symbol 
ID4599838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2062938 
End bp2064545 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content74% 
IMG OID639776531 
ProductATPase domain-containing protein 
Protein accessionYP_923130 
Protein GI119716165 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.493372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAGGG TGGCGACGAG GCTCGCGGGG TTGCACCGCC GGCTGCTGTT CGCCGACGAC 
CCGGATCCGG AGCTGCTCCA GGGTGTCTTC GCCGCCCTGT TCCTCCTCGA CTTCGTGCTG
CGCGCGGCCG GCGGCTCGTC GCCGGCGCTG GCTTCTTGGC CGGTCCTGGG GCTGCTCGTC
ATGGCGCTCG CCTCGGCGGC CGCGATCGCG GTGCCGTGGG AGCGGGCGCC CGTCTGGGCG
GTCGGCATCC TCCCGGCCGT CGACATCCTC GCCCTCGGCT TCTCCCGGAT GGACCCCGCC
GGCGGCGCGA CCGGGACGCT CGCAGTGGTG CCGGCGCTGT GGCTCGGCCG CCAGTACGGT
CGGCGCGGCG CCGTCGTCGT CCTCCTGTCG ACCGCGTTCC TGCTCGCCGG GCCGGGGCTG
CTCTACCTGG GGGCGTCGGG GGCGAACCTG ACCCGCACCG TGATGATCCC GCTGGTCGCC
ACCTGGGCCG CCCTGGCCAT CTCCTACGCG CTCGAGCGGG TCCGGGCCGG CATCGAGGTG
ATCGAGCACC AGCGTCGGGT CTCCCAGGCG ATCTTCGACA CCGTCGACGT GGGCCTGGTG
CTCCTGGACG AGTCAGGGGC CTACCAGGGG ATCAACCGCC GGCACGCCGA CTTCATCCGC
CTCGCCTTCC CCGAGGGGCA CCAGGGGCGA GCAGGCCAGC TCGGCGCGGT GTACGACGCC
GACGGCACCC GGGAGCTGGC CGCGGAGGAG ATGCCGACGC ACCGCGCGAG CCTGGGCGAG
CAGTTCGAGG ACGTGCGGAT CTGGGTCGGC CCGGACCCGC TGACCCGCCG GGCGCTGTCG
GTCTCCGCGC GCCGGGTGGA GGACGAGGCC GGGGCCCTCG CCGGCGCCTC GCTGGCCTAC
AAGGACGTCA CCGACTTCAT GCGGGCGCTG CGGGTCAAGG ACGAGTTCGT CGCCTCGGTC
TCCCACGAGC TGCGCACCCC GCTGACCTCC ATCGTCGGCT ACACGCAGAT GCTGCTCGAG
CGCGACGACC TGCCGGCCGA CGTCGTCGCC CAGCTCGAGG TGATGGCCCG CAACAGTGGG
CGGCTGCACC GGCTGGTCGC CGACCTGCTG CACACCGCCC GCCTCGACGA GGGCGGGGCG
CCCCTGGTGC GGGCGCCCGT CGACCTGGCT GCGGTCGTGC GGGACTGCGT GGAGGCCGCC
GCCCCGGCGG CGCGTCGCTC TGGCGTCGAG CTCGTCCTCG AGCTGCCGCC GACGCTGGTG
CTGCGGGCCG ATCAGGAGCG GCTGGCGCAG GTCGTCGACA ACCTCGTCTC GAACGCCGTG
AAGTACACCA GCGCCGGTGG CTGCGTGCAG GTGACCCTGC TCCTCGACGG GGACCGGGCC
GAGCTGTGCG TCGCCGACAC CGGGATCGGG ATCGCCGCGG CCGACCGGGA CCGCCTGTTC
ACCCGCTTCT TCCGCGCCCG CCAGGCCGAG GAGCGCTCGA TCCAGGGTGT CGGGCTCGGC
TTGAGCATCA CCAAGGCGAT CGTCGAGAGC CACGGTGGCC GGATCGAGGT CGAGAGCGAG
CTCGGCAGGG GCAGCGTGTT CCGCGTGCGG CTGCCCCTGG ACGCCTGA
 
Protein sequence
MGRVATRLAG LHRRLLFADD PDPELLQGVF AALFLLDFVL RAAGGSSPAL ASWPVLGLLV 
MALASAAAIA VPWERAPVWA VGILPAVDIL ALGFSRMDPA GGATGTLAVV PALWLGRQYG
RRGAVVVLLS TAFLLAGPGL LYLGASGANL TRTVMIPLVA TWAALAISYA LERVRAGIEV
IEHQRRVSQA IFDTVDVGLV LLDESGAYQG INRRHADFIR LAFPEGHQGR AGQLGAVYDA
DGTRELAAEE MPTHRASLGE QFEDVRIWVG PDPLTRRALS VSARRVEDEA GALAGASLAY
KDVTDFMRAL RVKDEFVASV SHELRTPLTS IVGYTQMLLE RDDLPADVVA QLEVMARNSG
RLHRLVADLL HTARLDEGGA PLVRAPVDLA AVVRDCVEAA APAARRSGVE LVLELPPTLV
LRADQERLAQ VVDNLVSNAV KYTSAGGCVQ VTLLLDGDRA ELCVADTGIG IAAADRDRLF
TRFFRARQAE ERSIQGVGLG LSITKAIVES HGGRIEVESE LGRGSVFRVR LPLDA