Gene MCA1533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1533 
Symbol 
ID3102613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1632843 
End bp1635740 
Gene Length2898 bp 
Protein Length965 aa 
Translation table11 
GC content63% 
IMG OID637170706 
Productsensory box protein 
Protein accessionYP_113988 
Protein GI53804176 
COG category[T] Signal transduction mechanisms 
COG ID[COG5001] Predicted signal transduction protein containing a membrane domain, an EAL and a GGDEF domain 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00254] diguanylate cyclase (GGDEF) domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.76081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTACC TGATCCTTGC TTTGGGTGTT GCTTTCGTTC CGATCGCGCT GGCCTACGGG 
TTGCTGCTGA AGCCCAAGCG GGCCGAAGCG GCGCTGCGAC GGAGCGCGTT CCATCTGCGC
GAAGCGCAGC AGGTGGCCAA TCTCGGCAGT TGGGAACTCG ATCTGGTGCA AGGCCGGCTG
GAATGGTCGG ACCAGGTGTT CCGGATTTTC GAGATCGATC CCGAACGGTT CGAGGCGTCC
TACGAAGCCT TCCTCGCCCT CGTGCATCCC GACGACAGGG AGAGGGTAGA CGCGGCCTAC
CGCCGCTCGA TCGCCGACAG AGGGTCCTAT GAAATCGTCC ATCGGCTGCT CATGCCAGAC
GGCCGCGTCA AACACGTCCG CGAGCGTTGC CAAACCTTTT ACAGCGAAGA TGGGCGGCCC
TTGCGATCAG TGGGGACGGT ATGGGACGTC ACATCCTTTC ATGAAACCGA AGAGAAACTT
CACCGACTCG CTGCCATCGT GGAATCGACC GGTGACACCG TCATCGTCAG CGATCTCCAA
GGCAGAATCC GTACCTGGAA CAAAGGCGCG GAACGGATGT GGGGCTATGC GGCAGAGGAA
ATCATAGGGC AGCCGGTCAC CGTTCTGCCA CCGCTCGCAA AGCAGGGCGA ACCGGGGCAG
ATATTGCGGC GAGTCATCGA TGAGCACGAG GTCGTCCGTT TCGAGAGCCG GGGTCTGCAC
AAGGACGGCC GCGATTTCCC GATATCGCTG ACGGTTTCGC CGCTTTTCGA CGCCGAAAAC
CGCCTGACCG GCGTATCCGC GATCATTCGT GACATCACGG AAATGGAACG GCTGAAAGCG
ACGTTGCAGG AGCGGCTCAA ATTGGTGGAA ATGGTCTTCC AATACAGTGT CTCGTGCCTC
GTCGTTCTGG ATCCCCAGTT CAATTTCATC CGTGTCAATC CGGCCTATGC GCGGGCCTGC
CGCCGGGATG TCGCCGAGTT CGAAGGCCGC AACCATTTCG AGCTATACCC TTCGGATGCC
CGGTTCATCT TCGAGGAGGT CGTGCGCACC CGCCGGCCGT TCGAAGCGGT TGCGCGCCCT
TTCACGTTTC CGGACCAGCC GGAACGGGGC GTCACCTACT GGGACTGGAA CCTGGTGCCG
GTTCTCGACG GCAAAGGCGA GGTTGAATAT CTGCTGTTCT CTCTGGTCGA CGTGACCGAC
AGGCAGAAGG CCACGGCGCA GCTGCGCCTG ATCGAAACCG CCTTCCAGCA TACCCGCGAT
GCGGTGCTGA TCACCGATGC CCGGGCCAGG ATCCTGCGGG TCAATCCTGC TTTCGAAACC
ATCACTGGCT ACAGTGCCGG GGAAGTGGTC GGACGCACGC CCAGAATGCT CCAGTCCGGG
CGTCATGACG GCGCTTTCTA CCGGCGGTTC TGGCGGGCGC TCGAGACCGA AGGGCATTGG
AGCGGCGAGA TCTGGAACCG CCACCGGGAC GGTCACATTT TCCCCGCATG GCAGTCGGTG
TCGGCGGTCA AAGGCCCCGA TGGCAGCACG ACCCACTATG TCAGCATCTT CACCGACATC
AGCGAGTTCA AGCGGGCCGA AGCTCAGATC CGCTATCTCT CCTACCACGA CGATCTCACG
GGCCTGCCGA ACCGGGCCTA TTTCCAGGCG CGGCTGGACC AGCTCATCGA AGCTGCGGCA
CGGGACCGGA AGCAGATCGC CCTGGTCGTG CTCGATCTGG ACCGTTTCAA GACCATCAAC
GATTCGCTGG GGCACAGCGT GGGCGACGAG CTTCTGTGCC AGGTAGCTGA GCGAATCCAG
GACTGTACCG GACAGGGTGA CTTCGTGGCT CGCCAGGGGG GTGACGAATT CGTCGTCATC
CTGGCAGACT CCGATGCCGT TCGGGCGGCG CGGGTCGCGA ACGCGATGAT CTCGGCCATA
GCGAAACCTG CCGTCGCCGG TGGCCGGACA CTGATCGTGA CGCCGAGCTT AGGGATCAGT
CTCTATCCCC ACGACGCCGG CGACTCCGAG AGCCTGATCA AGAGCGCCGA CGCGGCGATG
TACCACGCGA AGTCTCTGGG CCGCAACACC TACCAGTTTT TCTCGGCGGA AATGAGCGCC
GTGGCCACCG AGCGGCTGGC CCTGGAAAAT GCCTTGCACC AGGCCCTGTC GAACGGGGAG
TTCCGGCTTT ATTATCAGCC ACAGATCGAG GTCTGCTCCC GCCGGCTGAT CGGGCTGGAA
GCGTTGATCC GCTGGCGGCA TCCGGAGGTG GGCTGGGTGC CGCCGCTACG CTTCATCCCG
TTGGCCGAAG AGACCGGCAT CATCCATCGG ATCGGTCGAT GGGTCCTGGA GGAGGCCTGC
CGCCAGCAGC GCGAATGGCA GGCCGCAGGT CTGTGCATCG TTCCCGTCGC GGTCAACCTG
TCGCCGCTGC AATTGCAGAA GGACGATTTC CCGGAACAGG TGGAAGCGTT GCTGTCCGGC
TGCAGACTGG CGCCGGGACT CCTCGAACTG GAGCTGACCG AGACGGCGGT CATGCGCGAC
GTCGGACGGA TGTCCGACAT GCTGCGGAAA CTGAAGGAGC GCGGCACACG CTTTTCCATC
GACGATTTCG GCACCGGCTA CAGTTCGCTC GGATACCTCA ACCGTTTTCC GGTCGACAAG
CTGAAGATCG ACCAGTCGTT CATTCGCGAC GTGACCCGCA GGGAAGACCA TGCCGCGATC
ACTCGGGCCA TCATTGCCCT GGCCAAGCAG CTGCACTTGA AGGTGGTCGC CGAGGGCGTG
GAGACTGCGG AGCAACTCGA ATTCCTCGAA CGGGCCGGGT GCGACGGGGC ACAGGGGTTC
TACTTCAGCC GGCCGGTGCC GGCGGATGAG ATCGCCGGTT GGTTGTCCCG GGAGGGCGCG
GTTCAGGATC GCCGGTAG
 
Protein sequence
MDYLILALGV AFVPIALAYG LLLKPKRAEA ALRRSAFHLR EAQQVANLGS WELDLVQGRL 
EWSDQVFRIF EIDPERFEAS YEAFLALVHP DDRERVDAAY RRSIADRGSY EIVHRLLMPD
GRVKHVRERC QTFYSEDGRP LRSVGTVWDV TSFHETEEKL HRLAAIVEST GDTVIVSDLQ
GRIRTWNKGA ERMWGYAAEE IIGQPVTVLP PLAKQGEPGQ ILRRVIDEHE VVRFESRGLH
KDGRDFPISL TVSPLFDAEN RLTGVSAIIR DITEMERLKA TLQERLKLVE MVFQYSVSCL
VVLDPQFNFI RVNPAYARAC RRDVAEFEGR NHFELYPSDA RFIFEEVVRT RRPFEAVARP
FTFPDQPERG VTYWDWNLVP VLDGKGEVEY LLFSLVDVTD RQKATAQLRL IETAFQHTRD
AVLITDARAR ILRVNPAFET ITGYSAGEVV GRTPRMLQSG RHDGAFYRRF WRALETEGHW
SGEIWNRHRD GHIFPAWQSV SAVKGPDGST THYVSIFTDI SEFKRAEAQI RYLSYHDDLT
GLPNRAYFQA RLDQLIEAAA RDRKQIALVV LDLDRFKTIN DSLGHSVGDE LLCQVAERIQ
DCTGQGDFVA RQGGDEFVVI LADSDAVRAA RVANAMISAI AKPAVAGGRT LIVTPSLGIS
LYPHDAGDSE SLIKSADAAM YHAKSLGRNT YQFFSAEMSA VATERLALEN ALHQALSNGE
FRLYYQPQIE VCSRRLIGLE ALIRWRHPEV GWVPPLRFIP LAEETGIIHR IGRWVLEEAC
RQQREWQAAG LCIVPVAVNL SPLQLQKDDF PEQVEALLSG CRLAPGLLEL ELTETAVMRD
VGRMSDMLRK LKERGTRFSI DDFGTGYSSL GYLNRFPVDK LKIDQSFIRD VTRREDHAAI
TRAIIALAKQ LHLKVVAEGV ETAEQLEFLE RAGCDGAQGF YFSRPVPADE IAGWLSREGA
VQDRR