Gene MCA1083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1083 
Symbol 
ID3104539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1138045 
End bp1139916 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content60% 
IMG OID637170270 
Producthypothetical protein 
Protein accessionYP_113556 
Protein GI53804595 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.638894 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA AAAGATTTCT GGCTACGGCG GTCGGTATGG CTCTGGCGGG TGCGGGGGCA 
TCCCCTGTCA TCGCCGCCGT CAAAGGCGCG ATTCCGGTGA AGTCAGTGGA GTTCATCGGG
ATGGCGGCGC CCGAAAACGC CGAACAAAGG GCTGCTGCCT ACACCAACGC CAAGGCGAAG
GTCACTTTCA AGAGCGGAGG GAGCCGGATC TACGATCTCA ATTACAACAC GCTCTTCTAT
AACACCGACA CCATTGGCGG CGTGACTGCC GGCGCGATCT ATGATGTGAC CGGTACGCCA
TTGCTGGATT CGAACGGCAA TCCGATGATC TCCGAAAGTC CGGACGCGAA TAGTCTGATC
CGCATCCCCG CCAGGGGCGG TGACAAGCTT TTTCTGGTTA CCCATTTCGA ATACGACTGG
CTGGACACTG CCGGCGTCGA TCAGTACGGC AAGCAGCCGA TGACCATGAG CCTGGCGAGC
ATCGCGCAGG ACGGGAAAAC CGGGGCGCTG GCGGCGACCG GCCTCAAGAA CATCGACATG
TCGGCCATCG ATGGCTTGTG GATTCCCTGT GCGGGCTCCC TCTCGCCCTG GAATACCCAT
CTCGGCAGCG AGGAGTATGA GCCGGACGCC CGTTGCCAAG TGGAAAGCTC CTGTGCAAGC
GGATCCATTG GTCTGGAGGG GATGGAGCGT TATCTTGCCG GGACCAAGGT GGCCAACGTG
TACAACTACG GCATCGTGCC TGAAGTGACA GTCGACGGAA GCGGCGCAAC CCGCGTCGTC
CGCCATCGGA CCCTGGGTCG CGTGTCCCGC GAGCTGGTTC AGGTGCTGCC CGATGAACGC
ACGGTGTTCC AGGGCGATGA CGGTACCTAT AACGTGCTGA CGATGTTTGT GGCGGACCGG
AAGCGTGATC TGTCTGTCGG GACGCTGTAT GCCGCGAAAT GGCAGCAGAT CAGCGCCGAG
AATGGCGGCG AAGCCGATCT GGCATGGGTC CGTCTCGGCC ATGCCAGCGA TGCCGAGCTG
GATTCGCTGG TGGCGAGCGG CGTGACCTTT GCCGACATCT TCGAAACCGC CCCGGTGAAC
AAACTGGCCG ATGGTTCCTA TGAAGCGCCT CCGGCAGGTT TCAGGCAGAT CATCGCCGGC
CACAACAAGG GGCTGGTGGA AAATCTGAAG CTGAAACCCG GGATGGAAAC CGCGGCGGCC
TTCCTGGAGA CACGCCGCTT CGCCGCGTAT ATGGGGGCCA CCACGGAATT CGAAAAATTC
GAGGGCGTGA CGGTCAACGC CCGCGACCGG AAAGTCTATC TGGCGATGAC GCGCCTGAGC
AGTGGCATGG AGGACAAACC GAAGGATCCG GCCAACCACA TCCGTATCCC CAAATTGCTG
GCAGGCGCGG TCTACCAGAT GGATCTCGCG ACCTGGAAGC TGGACACCGA AGGCCGCAGG
ATCGACAGCA AGTACGCAGG GACTCACATG AAGGCCTTGG TGCTGGGACA GGACATCGCC
AAGGATGCCG CGGGAAATAC CGCCGCTGTC GACAGGATCG CCAATCCGGA CAATCTGAAA
TATTCGGAAA AGATGCGTAC CCTGTTCATC GGCGAGGACA GTAGCACCCT CCATATCAAC
AACTTCCTGT GGGCCTACAA CGTCGATACC GGCAAGTTGT CACGCATCCT CTCCTTGCCC
GCAGGCGCGG AAAGCACCGG GCTTCAGGCG ATCGACGCCC TGGGCGGTCA TGCCTACATC
ATGAGCAACT ACCAGCATGC CGGCGATTAT TCATCCAACA TCGATCCGGC CCTCAAAGGA
CAACTCGAAC CGCTGATCGA CAAGTCCAAG GCGGCGATCG GTTATCTCGG AGGTATGCCG
GCATTGGAGT GA
 
Protein sequence
MMKKRFLATA VGMALAGAGA SPVIAAVKGA IPVKSVEFIG MAAPENAEQR AAAYTNAKAK 
VTFKSGGSRI YDLNYNTLFY NTDTIGGVTA GAIYDVTGTP LLDSNGNPMI SESPDANSLI
RIPARGGDKL FLVTHFEYDW LDTAGVDQYG KQPMTMSLAS IAQDGKTGAL AATGLKNIDM
SAIDGLWIPC AGSLSPWNTH LGSEEYEPDA RCQVESSCAS GSIGLEGMER YLAGTKVANV
YNYGIVPEVT VDGSGATRVV RHRTLGRVSR ELVQVLPDER TVFQGDDGTY NVLTMFVADR
KRDLSVGTLY AAKWQQISAE NGGEADLAWV RLGHASDAEL DSLVASGVTF ADIFETAPVN
KLADGSYEAP PAGFRQIIAG HNKGLVENLK LKPGMETAAA FLETRRFAAY MGATTEFEKF
EGVTVNARDR KVYLAMTRLS SGMEDKPKDP ANHIRIPKLL AGAVYQMDLA TWKLDTEGRR
IDSKYAGTHM KALVLGQDIA KDAAGNTAAV DRIANPDNLK YSEKMRTLFI GEDSSTLHIN
NFLWAYNVDT GKLSRILSLP AGAESTGLQA IDALGGHAYI MSNYQHAGDY SSNIDPALKG
QLEPLIDKSK AAIGYLGGMP ALE