Gene Mmar10_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmar10_1441 
Symbol 
ID4285680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMaricaulis maris MCS10 
KingdomBacteria 
Replicon accessionNC_008347 
Strand
Start bp1579104 
End bp1580153 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content64% 
IMG OID638140923 
Productsignal transduction histidine kinase, nitrogen specific, NtrB 
Protein accessionYP_756671 
Protein GI114569991 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0722715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0000363452 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGGCAG CGATCGAACC CAGCGCTGCC GCCCTGGCCG CCGATCAGGC TGCTATCCCC 
ATTCTCATTT TCAGCGACGA GGCGCGCTGT GTCTGGGCCA ATACCCAGGC CGAGGAATGG
CTCGGCCTGT CGATCCGGAA CATCCGAAAA AGCCGGTTCG GGGAATTGTC GGCTGTCTGC
GCGCACCTCG CCGACATTGT CGATCAAGCC TGTGATGCGC GCCGGACCCT GGTGGCGCTG
GGCCGACCAC TGGGCGGGGC CGGGCTCTAT GATGTCCATG CCCGCTGGTC TGACCAGCAT
GAACAGCTCG CCCTTTCGGT CCTGCCCCAC CAATCCATGG GCGCCAAGGC GTCTGAAGCG
CCAGCGCTGG GTTTCGGCCG CATGCTCGCG CACGAATTGA AGAACCCGCT GGCAAGCGTT
CGCGGGGCGG CGCAACTGAT CCGGCGCGAG ACCGAGCTGG AAGGCGCACG TGATCTGGCC
CGCTTGATCA TCCAGGATGT CGACCGCATC ACCCGGCTGG CTGACCATTG GAGCCGGGTC
GGCGATATCC GCTTGGGCGA GCAGTCGGAG ATCAATCTCA ACCTGCTGGC AGTGAGTGCG
ATGGAGAGCC TCAACCGGGC TGATCCGGCC ACGATCGGTG TCCTGCGCGA GAATTTTGAC
CCATCCCTGC CGTCAATCAA TGGCGATCCG GACCTGCTGA TGCAGGCCGT CCTGAATCTG
ATCCAGAATG CCTTCGATGC CGTGCGGTCC GATCCCGGCG GGACGATTAC CGTGGAGACC
CGCTACGATG CCGGTCCACG TAGCCGGTCA AGCGGGCATC CCACGCCGCT GGTCCTCTCG
GTTCGCGACA ATGGTCCCGG CATTCCGGAA TCGCTCGGAC CGGGTATCTT TACACCGTTT
GTGACGACCA AACCCGCCGG CGAAGGTCTG GGGCTGGCAT TCGCGGCCCG GATTGCCGCG
CTGCACGACG GGCAGATCGA CTTTGAAAGC CGCCCGGGAA CCACCGTGTT CAATATCCGC
CTGCCGATTG CCAAGAAGGA TTTGCCGTGA
 
Protein sequence
MTAAIEPSAA ALAADQAAIP ILIFSDEARC VWANTQAEEW LGLSIRNIRK SRFGELSAVC 
AHLADIVDQA CDARRTLVAL GRPLGGAGLY DVHARWSDQH EQLALSVLPH QSMGAKASEA
PALGFGRMLA HELKNPLASV RGAAQLIRRE TELEGARDLA RLIIQDVDRI TRLADHWSRV
GDIRLGEQSE INLNLLAVSA MESLNRADPA TIGVLRENFD PSLPSINGDP DLLMQAVLNL
IQNAFDAVRS DPGGTITVET RYDAGPRSRS SGHPTPLVLS VRDNGPGIPE SLGPGIFTPF
VTTKPAGEGL GLAFAARIAA LHDGQIDFES RPGTTVFNIR LPIAKKDLP