Gene MCA1181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1181 
SymbolcysA 
ID3104525 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1235941 
End bp1236987 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content68% 
IMG OID637170361 
Productsulfate ABC transporter, ATP-binding protein CysA 
Protein accessionYP_113646 
Protein GI53804706 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1118] ABC-type sulfate/molybdate transport systems, ATPase component 
TIGRFAM ID[TIGR00968] sulfate ABC transporter, ATP-binding protein 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.36202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCG AAATCCGCAA CATCACCAAA TCCTTCGGCA GCTTCCAGGC CCTCAAGGGC 
ATCGACCTGA CCATCGGTTC CGGCGAACTG GTGGCCCTGC TCGGCCCCTC CGGCTGCGGC
AAGACCACGC TGCTGCGCAT CATCGCCGGG CTGGAGGCCG CCGACAGCGG CCAGATCCTG
CTCCACGGGG AAGACACGAC GCACCGCCAC GTCCGCGAGC GCCGGGTCGG CTTCGTGTTC
CAGCACTACG CCCTGTTCCG GCACATGAGC GTGTTCGAGA ACATCGCCTT CGGCCTGCGG
GTGCGCCCGC GCGGCCAGCG CCCGCCCGAA GCAGAAATCC GGCGGCGGGT GCAAGAATTG
CTGGAGCTGG TCCAGCTCGA CTGGCTGGCC GACCGCCATC CCGGCCAGCT CTCCGGCGGC
CAGCGCCAGC GCATCGCACT GGCCCGCGCC CTCGCCGTGG AACCGAAAGT CCTGCTGCTC
GACGAGCCGT TCGGCGCGCT GGACGCCAAG GTCCGCAAGG ATCTGCGCCG CTGGCTGCGG
CGCCTGCACG ACGGGCTGCA CATCACCTCG GTGTTCGTCA CCCATGACCA GGAAGAAGCG
CTGGAAGTCG CCGACCGGGT CGTCGTGCTG AACGCCGGCC AGATCGAACA GGTCGGCTCG
GCGGACGAGG TCTACGACCA TCCCGCCACG CCTTTCGTGT GCCAGTTCAT CGGCGACGTC
AACCTGTTCC ACGGCCGGGT GCACGGCGGC CGCGCCCTTA TCGGCGAGAC GGTGATCGAG
CTGCCGGACA TAGCGGAGTC GGACACCGAA AAGGCCTTGT TCTTCGCCCG TCCCCACGAA
ATCGAAATCG GCCGCGGCAC GGGCATCGGC GCCGTCGTCC GGGCCATCCG GCGGCGCGGC
AACGCGGTGC GGGTGGAGCT GGAGCGCAAG GATGGCAGGG GCGCCGTGGA AGCGGAACTC
AGCCGCGAAG CCTTCGGCCG CCACGCCATC AAGCACGGCG ACGAAGTGGT GATCCAGCCC
AGCAAGATCA GGATGTTTCA GCCCTGA
 
Protein sequence
MSIEIRNITK SFGSFQALKG IDLTIGSGEL VALLGPSGCG KTTLLRIIAG LEAADSGQIL 
LHGEDTTHRH VRERRVGFVF QHYALFRHMS VFENIAFGLR VRPRGQRPPE AEIRRRVQEL
LELVQLDWLA DRHPGQLSGG QRQRIALARA LAVEPKVLLL DEPFGALDAK VRKDLRRWLR
RLHDGLHITS VFVTHDQEEA LEVADRVVVL NAGQIEQVGS ADEVYDHPAT PFVCQFIGDV
NLFHGRVHGG RALIGETVIE LPDIAESDTE KALFFARPHE IEIGRGTGIG AVVRAIRRRG
NAVRVELERK DGRGAVEAEL SREAFGRHAI KHGDEVVIQP SKIRMFQP