Gene MCA1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1443 
Symbol 
ID3103404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1535365 
End bp1536966 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content62% 
IMG OID637170618 
Productsulfate transporter family protein 
Protein accessionYP_113900 
Protein GI53804196 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0659] Sulfate permease and related transporters (MFS superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00222579 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCAAAA AGCATCTCGC GTACTATCTC GCACATCTCA CCCAGGACAT CCCCGCCGGC 
ATCGTGGTGT TCCTGGTGGC GCTCCCTCTC TGTCTGGGTA TTGCGCTGGC GTCCGGCGCG
CCGCTGCTTT CAGGCGTGGT TGCCGGCATC ATCGGCGGAC TGGTTGTCGC TTGGGCCAGC
GGTTCGCAGT TGAGTGTGTC GGGCCCGGCC GCGGGCCTGA CGGTCATCGT GCTGCAGGGT
ATCGAAAAGC TCGGCGGCTT CGAGCACTTT CTGCTCGCCC TGATCCTGGC GGGGCTGATG
CAGCTCGCTC TGGGCTTCCT CAAGGCGGGA ACCATCGGCG CCTATTTCCC GTCGTCCGTC
ATCAAGGGCA TGCTTTCGGC GATCGGCCTG ATCTTGATTG CCAAACAGCT CCCCCACGCC
GTCGGCTATG GCCGGGACAT CCTGGGCGAG GAAACCTATC TGCCTCAGGA CACGGAGGGC
ACTTTCTCGG AACTGATGCA CGCCATGGAC TCGATTTCCC CGGGCGCGAC CGTCGTGAGC
GCCATTGCCA TCGTCATCAT GGTGCTGTGG GAGTCCCGGC TTGTCCGTGC CGTCCCGTTG
CTGGCGTGGA TTCCGGGGCC GCTGGCGGCG ATTGCCTGGG CGGTGGCGTT CAATCTCTCG
ATGGCGGGAT CGTCTTGGGA GATCGCGCCG GAGCACATGG TCCAGCTTCC AGACATCCGG
GGAGTCGGTG ATCTGGCCGG ACGGCTCGTC TTTCCGGATT TCAGCCGGAT CATGGATCCG
GCGGTGTACA GGGTGGCGTT CACGATTGCC GTCATTGCCA GCCTCGAAAC CCTGCTCAGC
TTGGAAGCAG TCGACAAGCT CGACCCCCTG AAGCGGGTGG CGCCGACCAA CCGGGAGCTG
AAGGCACAGG GTATCGGCAA CCTGTTGGCC GGCTTGCTGG GCGGTCTACC GCTGACGGCG
GTGATTGTGC GCTCGTCGGC CAACATCAAT GCCGGCGGGC GTACCAAGGT GGCCTGCTTC
ATCCACGGCC TTCTACTGCT GGTAAGCGTG AGCTTCCTCG CCCGCTACCT CAACCACATC
CCTCTGGCCG TGCTGGCCGC GATCCTGCTG ATGACGGGCT ATAAACTGGT CAAGCCGGCC
TTGGTGGCAG AGATGTACCA CAAGGGCGTG AGCCAGTTTA TTCCGTTCGC CGTCACGGTC
ATGGCGATAC TGGCCACCGA CCTGCTCATA GGGATCGCCG TCGGCATCGC CTGTGGCCTA
TACTACGTCG TCCGAGCGAA TTTCCATGCC GCCATTTCTC TCACCCGCCA TGGGAACCAC
TATCTGCTGC GACTGCGCAA AGACGTCTCG TTCCTGAACA AGGCACTGTT GCGCGAGCAA
CTGGACCAGG TCGAACCCGA CAGCGAGCTG ATCATCGACG GCACCTACGC GGAATTCGTG
GATCAGGACA TCCTCGAAAC CATCGAGAAC TTCGTCGAAG CCGCGCGCGA CGACCGTATC
GTGGTTTATC TGAAAAATTT CAAAGCCGCG TACGCCGGCA GCACCGCCAA GAACCGGCAA
GACAACACCG ATACCACCCC TTTGTACAGG ATTTCCGGCT GA
 
Protein sequence
MVKKHLAYYL AHLTQDIPAG IVVFLVALPL CLGIALASGA PLLSGVVAGI IGGLVVAWAS 
GSQLSVSGPA AGLTVIVLQG IEKLGGFEHF LLALILAGLM QLALGFLKAG TIGAYFPSSV
IKGMLSAIGL ILIAKQLPHA VGYGRDILGE ETYLPQDTEG TFSELMHAMD SISPGATVVS
AIAIVIMVLW ESRLVRAVPL LAWIPGPLAA IAWAVAFNLS MAGSSWEIAP EHMVQLPDIR
GVGDLAGRLV FPDFSRIMDP AVYRVAFTIA VIASLETLLS LEAVDKLDPL KRVAPTNREL
KAQGIGNLLA GLLGGLPLTA VIVRSSANIN AGGRTKVACF IHGLLLLVSV SFLARYLNHI
PLAVLAAILL MTGYKLVKPA LVAEMYHKGV SQFIPFAVTV MAILATDLLI GIAVGIACGL
YYVVRANFHA AISLTRHGNH YLLRLRKDVS FLNKALLREQ LDQVEPDSEL IIDGTYAEFV
DQDILETIEN FVEAARDDRI VVYLKNFKAA YAGSTAKNRQ DNTDTTPLYR ISG