Gene MCA2847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA2847 
Symbol 
ID3104522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp3038541 
End bp3040745 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content61% 
IMG OID637171976 
Productnitrogen regulation protein NtrY, putative 
Protein accessionYP_115241 
Protein GI53803076 
COG category[T] Signal transduction mechanisms 
COG ID[COG5000] Signal transduction histidine kinase involved in nitrogen fixation and metabolism regulation 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCATTC GCAAACTGAG AATCCGCCTG CCGCTCAGCC TGGTCGTGAT CACGTTGTTC 
ACGGCCATCG TCGCTTCGCT GCACCTGATG AGTTCTGCCA CGCAGAGCTC GCCGCAGCTC
GGCAGCATGT ACTCGCTGCT GGTCCTCATC AATTCGGTCG GTTCGATCCT GTTACTGACG
CTGGTCGTCG TCAATGTGTA CGGACTCCTC CGCCAGCTCG CCAGACGCGC GGCCGGCTCC
CATCTGACGG CCCGCATGGC ATTTCTGTTC GTCCTGCTCA GCCTGGCACC GGCATCCATC
GTGTTCTATT ACTCCATGCA GTTTCTCAAA CAAAGCATCG ACAGTTGGTT CGACGTCCGC
ATCGATCAGG CCATGGAGGA CGCGCTGGAG CTGGGGCGGG CGGCAATCGA CGAGCGGATG
CGCACCATGC TGCACCAAAC CGAACAGGCG GCCGCCAAAT TGCAGCTCGC CCCGATCATG
GAATATCCCT TGCGCCTGAC CGAACTTCGG GACGAGCTGG GCGAAGGTGA GTTCACGGTG
TTTTCCCGGC AAGGGCGGAT CATCGCCGCC GGCGGCTCTC AGCTGGGCTT CATCCTCCCC
GACCTGCCGG ATACCGGGGT TCTGCTCCGG GTGAAACACG GCAAATCCTT TATCCGGCTG
GAACCCAACC CGGACGGCAA TCTGCAGATC CGCGCCGTCG TGGCCATCCC CTCCGACGAC
CCTCTGTTTC TGCAGGCCGT GTTCCCGGTA CCACTGCGCA TCTCCCAGCT GGCCAGCACG
GTCGAAGCGG CCTATGTTCA CTACAAGGAA CTCGGGTTCC TCCGCAGTTC CCTCAAGCAC
ACATTCATCC TCACGCTGTC GTTGGTGCTG CTGCTCAGCC TCCTGGCGGC CATCTGGGTC
GCATTCGTGA GCATCCGCCG CATCGTCGCA CCGGTGCGAC GGCTGGCCCA GGGGACGCGC
GCAGTCGCCG AAGGCCGCTA CGGCCAGCGC CTCGTAGTGC GGGCGAAAGA CGAACTCGGT
TTCCTGGTCG AGTCGTTCAA CACGATGACC GAAAAACTCG CCCAGGCCAG TGAGGAGGCC
CGAATGTCGC GGCTCGAAGT CGAGCGCCAG CGCGCCTATC TCGAAACCGT GCTGTCCAAC
CTCTCCTCGG GCGTACTCAG CTTCGACAAA GAATCCCGGC TTCTCACCGC GAACCACGCA
GTGGACGAGA TCCTGCACGT TCCCGCCCAC GACTACCTGG GTCTCCCGCT CTCCCGGCTC
AAAACCGACC ATCCCTATCT CACCGAAACG CTGGAACGTA TCGAGCGCTG GCTGGAAAAA
GGCACCGGTA ACTGGGAGGG GGAAATGTCC TTCGTCGGTC CCTCGGGTCG GCGAGAGCTC
TACTGCCATG GCACCCCCCT GTTCAATGGC AACGGCGATA AACTTGGCGC CGTGGTGGTA
TTCGAAGACG TGACGGCATT GATCCTCGCC CAGCGCCAGG CGGCTTGGAG CGAGGTCGCC
CGACGGCTGG CCCACGAAAT CAAAAATCCG CTCACCCCAA TCCAGCTCTC TGCCGAACGT
CTGCATCACA AGCTGGCCAG CCGGCTCGAC GAGCGCGACG CGGAAATCCT GGACCGTTCG
ACCCGCACCA TCGTGCATCA GGTCGAGGCA CTGAAAGCCA TGGTCAATGC CTTCGCCGAA
TACGCGAGAT CCTCGACCAT CCAGCTCCGG CGAGTCGCGC TGGCCGAAAT CGTGGAAGAA
GTGGTGGCGT TGTACCCGCC ACAATCCGGC ATGAGTTTCG AAATCGATGA AGAACCGAGC
CTGCCCAGGA TTTCGGGCGA TCCGCTGCAG CTGCGTCAAG TCCTGCACAA TCTCATCAAG
AATTCGCAGG AAGCCCTGAC GCCGTCCACA CAAGGTAAGA TGTGCTTTAT TCTTCGCAAG
ACGGTCGATC AGGGCGAGGC CTTCGTGCAA CTGACCGTCC GAGACAATGG CCCCGGCATT
CCGAAGGAGC AGGCGGATCG GATCTTCGAA CCCTATGTCA CCACCAAAAC CAAGGGCACC
GGGCTGGGAC TTGCGATCGT CAAAAAGATC ATCGAAGAAC ACGGCGGGAC CATCCGCGTG
GAAAACGGAC TCCGGCAGGG CGCCGGTTTC ATCATCCGGT TCCCCGTCCC AGAAACAGAC
CTTGGAGCCG GCGCAAATGG CGGGAACTCT GGAGGGAAAT CATGA
 
Protein sequence
MVIRKLRIRL PLSLVVITLF TAIVASLHLM SSATQSSPQL GSMYSLLVLI NSVGSILLLT 
LVVVNVYGLL RQLARRAAGS HLTARMAFLF VLLSLAPASI VFYYSMQFLK QSIDSWFDVR
IDQAMEDALE LGRAAIDERM RTMLHQTEQA AAKLQLAPIM EYPLRLTELR DELGEGEFTV
FSRQGRIIAA GGSQLGFILP DLPDTGVLLR VKHGKSFIRL EPNPDGNLQI RAVVAIPSDD
PLFLQAVFPV PLRISQLAST VEAAYVHYKE LGFLRSSLKH TFILTLSLVL LLSLLAAIWV
AFVSIRRIVA PVRRLAQGTR AVAEGRYGQR LVVRAKDELG FLVESFNTMT EKLAQASEEA
RMSRLEVERQ RAYLETVLSN LSSGVLSFDK ESRLLTANHA VDEILHVPAH DYLGLPLSRL
KTDHPYLTET LERIERWLEK GTGNWEGEMS FVGPSGRREL YCHGTPLFNG NGDKLGAVVV
FEDVTALILA QRQAAWSEVA RRLAHEIKNP LTPIQLSAER LHHKLASRLD ERDAEILDRS
TRTIVHQVEA LKAMVNAFAE YARSSTIQLR RVALAEIVEE VVALYPPQSG MSFEIDEEPS
LPRISGDPLQ LRQVLHNLIK NSQEALTPST QGKMCFILRK TVDQGEAFVQ LTVRDNGPGI
PKEQADRIFE PYVTTKTKGT GLGLAIVKKI IEEHGGTIRV ENGLRQGAGF IIRFPVPETD
LGAGANGGNS GGKS