Gene MCA1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1005 
Symbol 
ID3103466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1054017 
End bp1055648 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content66% 
IMG OID637170191 
Productphosphate transporter family protein 
Protein accessionYP_113482 
Protein GI53804875 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0306] Phosphate/sulphate permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACAGA ATGCTCGTGA GGCGGTGGCG GCGCCCATTG CGGCGGTAGT GGAAAAACCG 
AAGCTGGAAA CCCGCGCCGG ATCGGTCGGG TCGATCGTTT TCATGGGGCT TCTCGGCGTC
GGCGTCGCCT TCACGGCGTA CGGCCTGCAA AGCGACGTGG CCGGTGCCGG TACCGCGCTG
GCTTTGGCGC CTTTGCTGCT GCTCGGGCTG TCGCTCCTGA TCGCCCTGGG TTTCGAGTTC
GTGAACGGAT TCCACGATAC GGCGAATGCC GTCGCCACCG TCATCTACAC CCATTCGCTG
CCGCCGATGG CCGCGGTGGT GTGGTCGGGT GTCTGGAATT TCGTCGGCGT GCTGGTGTCG
TCGGGGGCCG TCGCCTTCGG CATCGTTTCC CTTTTGCCGG TCGAACTGAT TCTCCAGGTG
GGGAGCGACG CGGGCTTCGC CATGGTGTTC TCGCTGCTGA TCGCGGCCAT CTTGTGGAAT
CTCGGGACCT GGTATTTCGG CCTGCCGGCC TCCAGCTCGC ACACCCTGAT CGGCTCGATC
ATCGGCGTCG GGCTCGCCAA CGAGCTGATG TATCGGGGCG GGTCCGCCAC CTCGGGGGTG
GACTGGTCGC AGGCGGCGAA AGTGGGCCAA TCGCTCTTGA TGTCGCCACT CATCGGGGCC
GTCGCCGCCG GTCTGCTGCT GGTCGTACTC AAGTTCGTCG TCGCCAATCC CAGGCTCTAC
CGCGCTCCGG AAGGCTTGCG GCCGCCGCCG TGGTGGATAC GCGGCGTCCT GGTGCTGACC
TGCACCGGCG TCTCCTTCGC GCACGGCTCG AACGACGGGC AGAAAGGCAT GGGGCTGATC
ATGCTGATCC TGATCGGCAT CGTGCCGACC GCCTATGCCT TGAATCGGGC GGTGCCGGAC
TCCTACGTAC CCGAATTCCT GGCCTTGTCG CATGTCGTCG GGCAGGCACT GCTGGAAAAG
TCGGAAGGGG CGAAAGAATT CGACGGTGAT CCCCGGCCTG CCGTGACCGA CTTCATCCGG
ACCCGCCAAC CGGGGCCGGA TACGCTGGCG GCGACCGCGA AGCTGGTCGG GGAAATCGCC
GGCACCATCG AAAAAACCGG TTCGCTCGCC CACATTCCCG TGACCAAGAC CCAGAACGTC
CGCAACGACA TGTACCTCCT CGACGAGGCG ATGCGGCGGC TGGAAAAGCT GGGCCAGCCG
GTGTTCGACG AGCGCACGAG AAAGGAGATC GGCGGCTACC GCAAGGCGCT GGGGAACGCC
ACCCGCTTCA TCCCCACCTG GGTGAAGGTC GCCGTCGCCA TCGCCCTGGG CCTGGGCACG
ATGGTGGGCT GGCGGCGCAT CGTGGTGACC GTCGGCGAAA AGATCGGCAA GCAGCACCTG
ACCTACGGCC AGGGCGCGTC GGCGGAACTG GTGGCCATGA GCACCATCGC GGCGGCGGAC
ATGTACGGGC TGCCGGTGTC GACGACCCAC GTGCTGTCGT CAGGAGTGGC CGGGACGATG
GCGGCCAATG GCTCGGGTCT GCAGTGGTCC ACGATCCGCA ATCTGGCCAT GGCCTGGGTG
TTGACCCTGC CTGCCGCGGT CCTGCTTTCC GGGAGCCTCT TCTTCGTGTT TCACCAGATG
ATCGCCCATT GA
 
Protein sequence
MVQNAREAVA APIAAVVEKP KLETRAGSVG SIVFMGLLGV GVAFTAYGLQ SDVAGAGTAL 
ALAPLLLLGL SLLIALGFEF VNGFHDTANA VATVIYTHSL PPMAAVVWSG VWNFVGVLVS
SGAVAFGIVS LLPVELILQV GSDAGFAMVF SLLIAAILWN LGTWYFGLPA SSSHTLIGSI
IGVGLANELM YRGGSATSGV DWSQAAKVGQ SLLMSPLIGA VAAGLLLVVL KFVVANPRLY
RAPEGLRPPP WWIRGVLVLT CTGVSFAHGS NDGQKGMGLI MLILIGIVPT AYALNRAVPD
SYVPEFLALS HVVGQALLEK SEGAKEFDGD PRPAVTDFIR TRQPGPDTLA ATAKLVGEIA
GTIEKTGSLA HIPVTKTQNV RNDMYLLDEA MRRLEKLGQP VFDERTRKEI GGYRKALGNA
TRFIPTWVKV AVAIALGLGT MVGWRRIVVT VGEKIGKQHL TYGQGASAEL VAMSTIAAAD
MYGLPVSTTH VLSSGVAGTM AANGSGLQWS TIRNLAMAWV LTLPAAVLLS GSLFFVFHQM
IAH