Gene MCA0899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA0899 
Symbol 
ID3103922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp943502 
End bp944518 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content64% 
IMG OID637170092 
Productsulfate starvation-induced protein 2 
Protein accessionYP_113385 
Protein GI53804804 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.368441 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTCT CGATCAAGGC AGGCGCAGCC GCATCCCTGC TCGCGCTCGC AGCACTATTC 
GGCGCCGGCA CGGCCCCGGC CTCGGACCTG ACCCTGCTCA ACGTGTCATA CGATCCGACC
CGCGAGTTCT ATCAGGATTA CAACGCCGCC TTCGCCAAGC ACTGGAAGGA AAAAACCGGC
CAGACGCTCG AGATCCGCCA GTCGCACGGC GGCTCGGGCA AGCAGGCGCG GGCGGTGATC
GACGGACTGG ACGCCGATGT CGTCACCCTC GCTCTGGCCT ACGACGTGCA CCAGTTGCAC
GAGAAGCGCA AGCTGATCTC GGCGGACTGG CAGGCCAAGC TGCCCCACAA CAGCGCCCCC
TACACCTCCA CCATGGTATT CCTGGTGCGC AAGGGCAACC CTCTGGGCAT CAAAGACTGG
GACGATCTGG CCAAGACCGG CGTATCGGTA GTGACGCCCA ACCCGAAGAC CTCGGGCGGC
GCGCGCTGGA ACTACCTGGC CGCGTGGGGC TACGCCCTGA AGAAGTACGG CAACGAGCAG
GCGGCGCGGG ACCTGGTCGC GAAGATCTAC AAGAACGCCG CCGTGCTCGA CACCGGCGCC
CGCGGCTCGA CCATCACCTT CGCCGAACGG GAAATCGGCG ACGTGCTGAT CACCTGGGAA
AACGAGGCTT ACCTCATCCT GAAAGAGTAC GGCGCCGACA ACTTCGAGAT CGTCGCACCC
TCCATCAGCA TACTGGCCGA ACCCACGGTC ACCGTCGTCG ACGACATCGT CCGCCAGCGC
GGCACCGGCG ACGTCGCCAA AGCCTACCTG GACTACCTTT ACAGCCCGGA AGGCCAGGAA
TTGGCGGCCA AGCACCACTA CCGGCCACGC GACCAGGCGG TACTGGCCAG ACACGCCAAG
GATTTCGCCC CCATCCAACT GTTCACGATC GATGAATTGT TCGGCGGCTG GGGCAAGGCG
CAGAAGATCC ACTTCGCCGA CGGCGGCGTC TTCGACCAGA TTTACAGCGC CAAGTGA
 
Protein sequence
MKLSIKAGAA ASLLALAALF GAGTAPASDL TLLNVSYDPT REFYQDYNAA FAKHWKEKTG 
QTLEIRQSHG GSGKQARAVI DGLDADVVTL ALAYDVHQLH EKRKLISADW QAKLPHNSAP
YTSTMVFLVR KGNPLGIKDW DDLAKTGVSV VTPNPKTSGG ARWNYLAAWG YALKKYGNEQ
AARDLVAKIY KNAAVLDTGA RGSTITFAER EIGDVLITWE NEAYLILKEY GADNFEIVAP
SISILAEPTV TVVDDIVRQR GTGDVAKAYL DYLYSPEGQE LAAKHHYRPR DQAVLARHAK
DFAPIQLFTI DELFGGWGKA QKIHFADGGV FDQIYSAK