Gene GM21_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1404 
Symbol 
ID8136732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1650922 
End bp1652007 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content63% 
IMG OID644869018 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_003021221 
Protein GI253700032 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0000236023 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTGAGA AACTGGAGAG TTACTACGCC AACGTGATCG ACAGCGTGGG TGACGGCGTG 
ATCGTCCTGG ACAACGCCGG GGCCGTGACG CTGGTCAATC CCGCCGCCGA GGAACTGGCC
GGCGTTTCGC GCCGCCAGGC GATGGGCGTT CTCTTCAGCG AGATCTTCAA AGGGGAAGGA
CCTCTCAACG AGATGGTCGC CAAGACCGTG GAGACAGGGA TGTCGGTATC CGACCACGAG
AACATCGTGG TCAAGCGGGG GGGGAAGCTG ATCCCCGTCG GCGCCAGCAC CTCGCCGCTT
TTAAGCGCGA GCGGGGAGCG CATCGGCACC ATTTTACTTC TGCGGGACCT GACCAACGTG
CGCGAGCTGG AGTCGGCCGT GCGCCAGGCG GACCGCCTTT CGGCGCTGGG CGGGCTGGCA
GCCGGGCTCG CGCACGAGAT CAAGAACCCA CTGGGCGGGA TCAAGGGGGC GGCGCAGCTA
CTGGAACTGG AATTTCCCGA CAACGAGGAC CTGCGCGAGT ACATCAGGGT GATGCTGAAG
GAGGTACAGC GGGTCAACCT CATCGTGGAG GAACTCCTGG CGCTCGCTTC GCCGGGGCGT
CTGAAGCTCT CCAAGGTGAA CCTGCACCGG GTCCTTTCCG ACATCGTCTT GTTGCAGAAA
AACGCCAGCG AGGGGAAAGA GGTCTACCTC CAGCAGTACT TCGATCCTAG CATCCCTCCC
ATCCTGGGGG ACGAGGCGCT TTTAACCCAG CTCTTCCTGA ACCTGATCAA GAACGCGCTG
GAGGCGGTGG AGGCAGGCGG CGTGGTGAAG GTGACCAGCC GGGTGCTGTC GGACTACAGC
ATGACCCAGC GGGGGGAGCG GCGGGCGCGC ATGGTGGCCA TCGACATAGC CGACAACGGT
CCGGGTATCG AGGCCGAGGT GCTGGAGAAC ATGTTCACCC CGTTTTTCAC CACCAAGTCC
CAGGGGACAG GGTTGGGTCT TGCCATCTGC CAGAAGATCG TCTCTGAGCA TCGGGGCATG
ATCAAGGTGG ATTCCGACGC CAAGCGCGGC ACCGTCTTCA CCGTCATGCT GCCGCTGGTG
CAATAA
 
Protein sequence
MSEKLESYYA NVIDSVGDGV IVLDNAGAVT LVNPAAEELA GVSRRQAMGV LFSEIFKGEG 
PLNEMVAKTV ETGMSVSDHE NIVVKRGGKL IPVGASTSPL LSASGERIGT ILLLRDLTNV
RELESAVRQA DRLSALGGLA AGLAHEIKNP LGGIKGAAQL LELEFPDNED LREYIRVMLK
EVQRVNLIVE ELLALASPGR LKLSKVNLHR VLSDIVLLQK NASEGKEVYL QQYFDPSIPP
ILGDEALLTQ LFLNLIKNAL EAVEAGGVVK VTSRVLSDYS MTQRGERRAR MVAIDIADNG
PGIEAEVLEN MFTPFFTTKS QGTGLGLAIC QKIVSEHRGM IKVDSDAKRG TVFTVMLPLV
Q