Gene Noca_3604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3604 
Symbol 
ID4599407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3820849 
End bp3823077 
Gene Length2229 bp 
Protein Length742 aa 
Translation table11 
GC content71% 
IMG OID639778212 
ProductATPase domain-containing protein 
Protein accessionYP_924791 
Protein GI119717826 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0643] Chemotaxis protein histidine kinase and related kinases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.864883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGAGA TCGACGAGAT CGTCCAGGAG TTCCTCGTGG AGAGCCACGA GAACCTCGAC 
CAGCTCGATC GCGACCTGGT CGCGCTCGAG AGGGAGCCCG GGTCCCGTGA CCTGCTGGGC
AGCATCTTCA GGACCATCCA CACCATCAAG GGGACCAGCG GGTTCCTGGC GTTCGGCAAC
CTCGAGTCGG TGACCCACGT CGGGGAGAAC CTCCTCGCCC GTCTGCGCGA CGGCAAGCAG
TCGATGACCC CGCAGACCAC CGACGTGCTG CTCGCGATGG TCGACAAGGT GCGCGAGCTC
CTGGCAGTGA TCGAGGACTG CGGCACCGAG GGCGACATCG ACGTCACCGA GGTCGTGGAG
CGGATCACCG CGGTGCTCGA GGGCAACGCC GCCCCGACCC CCATCGCCGA GCTGCTGCCC
GAGCCCGAGG CCGCGGTGGA GGTCGAGGCC GAGGTCGCCG CCGAGCCGGT GCCCACCCCC
ACGCCCACTG CCACGGCCGT CGAGGCGACC GAGGAGGTCT CGTCGACCCG CCGGAACGTC
GCCGAGTCCT CGATCCGCGT CGACGTCGAG CTGCTCGACA CCCTGATGCG CCTGGTCGGT
GAGCTGGTGC TCACCCGCAA CCAGATCGTG CGGCAGGCCG GCGACCAGGA CGACCTCGAC
CTGACCCGGT CGGCCCAGCG CCTCAACCTG ATCGCGACCG AGCTGCAGGA GGGAGTCATG
AAGACCCGCA TGCAGCCGAT CGATCATCTC TGGTCGAAGC TGCCGCGCGT CGTGCGCGAC
CTCGGCGCGG CCTGCGGCAA GAGCGTCACC CTGGCCATGG TGGGCCGGGA GACCGAGCTC
GACCGCTCGC TCCTGGAGTC CGTCAAGGAC CCGCTGACCC ACCTGGTCCG CAACGCGGTC
GACCACGGCC TCGAGGACCC CGCGGGCCGG ATCGCAGCCG GGAAGCCCGC TGAGGGCGTG
CTGACCCTGC GCGCCTACCA CGAGGGCGGC CAGGTGGTCG TCGAGGTCTG CGACGACGGA
GCGGGCATCG ACGCCGACCG GATCGCCGAG AAGGCGCTGG CCAGCGGCCT GCGCACCACC
GCCCAGCTCG CTCAGATGTC GCCCGCCGAC ATCCTCCAGC TGATCTTCCT CCCCGGGTTC
TCGACCGCGC AGTCGGTCAC GAACGTCTCC GGGCGCGGCG TCGGCATGGA CGTCGTGAAG
ACCAACATCG AGGCCATCGG CGGGACCATC GAGGTCGAGT CCGTCGTCGG CCGGGGCACC
ACCTGCCGGC TGCGGATCCC CCTGACGCTG GCGATCGTGC CGGCGCTCAC CGTGGAGTGT
GCGGGCGACC GGTATGCGAT CCCACAGGTC AGCCTGCTGG AGCTGGTCGC GCTGGACGCC
GACCGGGCCG CCACTGCGGT CGAGGACGTC AACGGCGCGT CCGTCTACCG CCTGCGCGGC
GCCCTGCTCC CGCTGGTCCG CCTCACCGAC GTCCTCGGTG TCGAGTCCGA CCGGTCCGAC
GGGCACGTGC TCATCGCGGT GCTCCAGGCC GAGGGCAAGC GCTTCGGCCT GGTCATCGAC
CGGGTGCTCA GCACCGAGGA GATCGTGGTC AAGCCGCTGA CCTCCCGGCT GAAGTCGCTG
GGGACCTACT CCGGCGCGAC CATCCTCGGT GACGGCCGGG TGGCCCTGAT CCTCGACGTC
CAGGCGCTCG CCCGACGGGC GCTCACCGCC GAGGCCCTCG AGCGCGGCGC CGCCGGCACC
GCCGCCGAGG AGACCAAGAC GTCCCGGGAG CACCTGCGGA TGCTGGTCGC CGGCATCGGT
GGCGGCCGCC GCGTCGCGAT CCCGCTCTCC TCGGTCACCC GCCTCGAGAA CATCGCGGCC
AGCACGGTCG AGGTGGTGGG CAGCCGCGAG GTCGTCCAGT ACCGCGGCGC GATCCTGCCG
CTGCTGCGGC TGGACCGGCA CCTCGGCGCG ATCAGCGAGC GCGCCGGCGA CGACCTGGTG
GTCGTCGTCT ACTCGGCCGG CGCACGCAGC GTCGCGATCG TGGTCGACGA GATCATCGAC
ATCGTCGACG AGGAGTCCGA GGTCCACAGC GACATCGACG ACCACGGCCT GGTCGGCTCC
ACGCTGATCC GCGACCGGAT CGTCGAGGTC CTCGACGTAC GCGCGGCGAT CCTCGCCGCC
GACCCGAAGT TCTACTCCGA CTCGGACGTC GAGATCGATC TCCGCGACGA GCTCCAGGAG
GCGGTGTGA
 
Protein sequence
MEEIDEIVQE FLVESHENLD QLDRDLVALE REPGSRDLLG SIFRTIHTIK GTSGFLAFGN 
LESVTHVGEN LLARLRDGKQ SMTPQTTDVL LAMVDKVREL LAVIEDCGTE GDIDVTEVVE
RITAVLEGNA APTPIAELLP EPEAAVEVEA EVAAEPVPTP TPTATAVEAT EEVSSTRRNV
AESSIRVDVE LLDTLMRLVG ELVLTRNQIV RQAGDQDDLD LTRSAQRLNL IATELQEGVM
KTRMQPIDHL WSKLPRVVRD LGAACGKSVT LAMVGRETEL DRSLLESVKD PLTHLVRNAV
DHGLEDPAGR IAAGKPAEGV LTLRAYHEGG QVVVEVCDDG AGIDADRIAE KALASGLRTT
AQLAQMSPAD ILQLIFLPGF STAQSVTNVS GRGVGMDVVK TNIEAIGGTI EVESVVGRGT
TCRLRIPLTL AIVPALTVEC AGDRYAIPQV SLLELVALDA DRAATAVEDV NGASVYRLRG
ALLPLVRLTD VLGVESDRSD GHVLIAVLQA EGKRFGLVID RVLSTEEIVV KPLTSRLKSL
GTYSGATILG DGRVALILDV QALARRALTA EALERGAAGT AAEETKTSRE HLRMLVAGIG
GGRRVAIPLS SVTRLENIAA STVEVVGSRE VVQYRGAILP LLRLDRHLGA ISERAGDDLV
VVVYSAGARS VAIVVDEIID IVDEESEVHS DIDDHGLVGS TLIRDRIVEV LDVRAAILAA
DPKFYSDSDV EIDLRDELQE AV