Gene Nham_3802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_3802 
Symbol 
ID4033346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp4179518 
End bp4181242 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content56% 
IMG OID637972200 
Productsignal transduction histidine kinase 
Protein accessionYP_578974 
Protein GI92119245 
COG category[T] Signal transduction mechanisms 
COG ID[COG3920] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.560077 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGTCTG GCGCTGGGCG CGCTAACTCT TTCCCTGATG AGCCAGACCC TCCCTTCTCT 
CTGACGACGT CACCTTATCG AACGAAGAAA TGGTCCTCGC TTAAGGCCTG GGGAATCAAA
CTCGCAAATC AGATCGGCAG CATCGTAAAG CCATATCAAG CCATGATGAT CAACTGGTTC
AAAGCTCCGA CCAACCCGCG CCGCGCTTGG CTTTATTTTC TGATGTTCGG TGGGGCTTAC
TCGGTACTCT GCATTTTCGG TGCCCTCACG GCCAACGATA AGATCTTGTC GCTGATCTGG
CCCGCCAACC CGTTTATGCT TGGTATGCTC GTGCGCTTTC CTGTATTGGC GCTTCCCCTG
GGCTGGGTCG CTTGTCTTGC CGGATTCGCG ATTGCGGTCC CGATTATCGG TTGCGGGCTT
ATGACAGGCG CGGGTCTGGC CGCATATAAT TTCGGTGTCG TGGTCATTGG CTATGTCTTG
CTATCCAGGC TCGATCGGAT TGACCAGCGT CTCCAGCGCC TGACTTCGGT TTTCTACCTG
CTCTTTGCGG TGGGCGCCGC GTCCATATTC GCCGGCATCG TTGGCTCGCT CCTGATCGGT
CCTTTATTCC ACGATCCGGT AGCCGTAAGC TCATTTCGGT ACTGGTTTTC CGTCGAGCTT
TTGAACCAGT TGGCTTTTTT GCCGATAATC CTATCCTACC CGGAAGGTCG CAAATGGGGG
CGGCAGCAAC TGCCGCAGCC GACCTTTCAC GACCAAGCGC CGATTATTGT GTTGGTGCTC
TCGGCCGTCA CGGGAATCTT CTTCGGAGGA GTGGGGGCTC TCGCTTTCCC TGTACCGGCC
CTTCTCTGGT GCGCAATTTC CTATCGCGTC TTTCTGACGG CCTTACTGAC CTTCGTATTC
TGCATATGGG GGATCATGGC GACAACGTTG GGCTACGTTG ACGCATCGCA TGTCGATCAA
TCGCTCGTGC TGTCGATCAG CATGGGGGCT GCACTGATCT CTCTGGGCCC GCTCATCATC
TCCACGACCA CTGCGACCAG AAATGAAATC CTTGACCAGC TAAGGCACCT CGCGGCCGAG
CGTGAGCTTG TGGCCAATGA ACTCGATCAT CGGATCAAGA ACCTGTTCGC GCTCGTCAAC
GGGCTGATCA GCCTATCGGT CCGTGGCAAA CCCGAGATGA AACCGCTGGC AGACACACTG
AGAAGCCGGC TCGTGGCATT ACACCATGCG CATGGTCTTG TTCGCATTCG CACCGGGAGC
GCGTCGTCGG GCCCTCCCGG GGGATTCGCT TCACTGAAGG AGCTCATCGG CACCCTGCTT
CGGCCTTATG AGGGTGCTGA AGACAAACAC GTCGTCGTCG ATGGCGATGA CGTGTTCATC
GATGGTGGGA TCGTTACGTC GCTGGCTCTC GTTTTCCACG AACTGGCGAC AAATTCGACC
AAGCATGGCG CGTTGAGTGA TCTGGACGGC GCCTTGGGCG TCCGTATCAG CCGCGACATT
GACGACCTGC ATGTCATGTG GACTGAGAGG GCCCCCGTGA CGACAGACTA TTCTGATGCT
GCGGACAGTG GCTTTGGTTC GAAGCTTCTG GATCTTACAA TCAACGAACA ATTGCAAGGA
AGCTATGTCC GCACCTGGAC AGTAGGCGGC ATGGACATCG AGATCATTCT GCCGCGCAAA
CTGTTTAGCG ACATCCCATC CAATCCCTCC ATCTTGTCGT CGTAG
 
Protein sequence
MGSGAGRANS FPDEPDPPFS LTTSPYRTKK WSSLKAWGIK LANQIGSIVK PYQAMMINWF 
KAPTNPRRAW LYFLMFGGAY SVLCIFGALT ANDKILSLIW PANPFMLGML VRFPVLALPL
GWVACLAGFA IAVPIIGCGL MTGAGLAAYN FGVVVIGYVL LSRLDRIDQR LQRLTSVFYL
LFAVGAASIF AGIVGSLLIG PLFHDPVAVS SFRYWFSVEL LNQLAFLPII LSYPEGRKWG
RQQLPQPTFH DQAPIIVLVL SAVTGIFFGG VGALAFPVPA LLWCAISYRV FLTALLTFVF
CIWGIMATTL GYVDASHVDQ SLVLSISMGA ALISLGPLII STTTATRNEI LDQLRHLAAE
RELVANELDH RIKNLFALVN GLISLSVRGK PEMKPLADTL RSRLVALHHA HGLVRIRTGS
ASSGPPGGFA SLKELIGTLL RPYEGAEDKH VVVDGDDVFI DGGIVTSLAL VFHELATNST
KHGALSDLDG ALGVRISRDI DDLHVMWTER APVTTDYSDA ADSGFGSKLL DLTINEQLQG
SYVRTWTVGG MDIEIILPRK LFSDIPSNPS ILSS