Gene Noca_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4520 
Symbol 
ID4597039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4779074 
End bp4780489 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content74% 
IMG OID639779131 
ProductATPase domain-containing protein 
Protein accessionYP_925704 
Protein GI119718739 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.320419 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACCGTCT TCCCGCCGGT CCCTCCGGAC GGGCGGTGGC ACTACCGCCG CTCGCTCGCC 
AGTCGGGTCA CGCTGCTCAC CACCATCGCG GTCGGCGTCA CGGTCGCGTT CGTCGCCGCC
GGCGCCTACC TCACGGTCCG GATGCAGATG CAGACGACCC TCGACGACTC CCTCGTCGAG
CGGGCCACCA CCGCCTCGCT CGACCTGTGC CAGAACGACC TGCACATCCC CGACGAGTAC
CTCGGCGCCG CGGACGTCTG GGTCGCCTGC CTGCAGGGCG GCATCGGGGT CGCGACCTCG
GTCGACAAGA GCTCGCCGAT CAGCCTGCTG GGCTCGCCGG AGGCGGCCGT GGTCGGTGGG
GAGCGCCAGC AGTCGCTGCG CACCGTGGTC AGCAAGGACG GCGACACCCA CTGGCGGGTG
ATCGCGCTGC GCCGCGACGG CGGCCAAACG ATGATCCTGG CCCAGTCGCT GGAGTCGCAG
CAGAACGTGC TCAACCGGCT CGGCATCGTG ATGTTCCTCT TCGGCCTCGC CGGCGTGATC
GCCGCCGGCA TGGCCGGCTG GGCGGTGGCC CGCAACGGGC TCCGGCCGGT GCGGCGGCTC
ACCTCGTCGG TGGAGCAGAT CGCCCGCACC GAGGACCTGC GCCCGCTGCC GATCGAGGGC
GACGACGAGA TCGCCCGGCT CGCCACCGCG TTCAACCAGA TGCTGGCGGC GCTGGCGGCC
TCCCGCGACC GGCAGCGCCA GCTGGTCGCC GACGCCGGCC ACGAGCTGCG CACCCCGCTC
ACCTCGCTGC GCACCAACCT CGACCTGCTG GGCCAGGCCG ACCGGGGTGG CGTCGACCTC
CCGTCGGAGG CGCGCGCCGA GCTGCTCGAC GACGTGGCCG CCCAGATCGA GGAGATGACC
ACGCTGATCG GCGACCTGAT GGAGCTCGCC CGGGACGAGC CGCTCACCCA CGTGGTCGAG
CCGGTCGACG TACCCGAGCT GGTCGACCGG GCCATCGCGC GGGTCCGCCG CCGCGCGGCC
GGCGTGACCT TCGACGTGGT CGCCGAGCCC TGGTTCGTCG TCGGCGAGAG CGCCGGGCTC
GAGCGCGCCG TCACGAACCT GCTCGACAAC GCGGCGAAGT GGAGCCCGGC CGGCGGGACC
GTGCGGGTCC GGCTGGCGGG CGGCGTGCTC ACCGTCGACG ACGACGGGCC GGGGATCTCC
GAGGAGGACC TGCCGCACGT CTTCGACCGG TTCTACCGCT CGCAGGAGTC GCGGTCGATG
CCCGGCTCCG GGCTGGGGCT CTCCATCGTG CGCCAGGTCG CCGAACGGCA CTCCGGCACC
GTGTGGGCGG GGGCCGCGCC GACCGGCGGT GCCCGGCTCA CGCTGTGGCT GCCCGGCTCC
CGCGCGCCCG TCCCGGACTG GACGCCCGCG TCGTGA
 
Protein sequence
MTVFPPVPPD GRWHYRRSLA SRVTLLTTIA VGVTVAFVAA GAYLTVRMQM QTTLDDSLVE 
RATTASLDLC QNDLHIPDEY LGAADVWVAC LQGGIGVATS VDKSSPISLL GSPEAAVVGG
ERQQSLRTVV SKDGDTHWRV IALRRDGGQT MILAQSLESQ QNVLNRLGIV MFLFGLAGVI
AAGMAGWAVA RNGLRPVRRL TSSVEQIART EDLRPLPIEG DDEIARLATA FNQMLAALAA
SRDRQRQLVA DAGHELRTPL TSLRTNLDLL GQADRGGVDL PSEARAELLD DVAAQIEEMT
TLIGDLMELA RDEPLTHVVE PVDVPELVDR AIARVRRRAA GVTFDVVAEP WFVVGESAGL
ERAVTNLLDN AAKWSPAGGT VRVRLAGGVL TVDDDGPGIS EEDLPHVFDR FYRSQESRSM
PGSGLGLSIV RQVAERHSGT VWAGAAPTGG ARLTLWLPGS RAPVPDWTPA S