Gene MCA1346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMCA1346 
Symbol 
ID3104605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylococcus capsulatus str. Bath 
KingdomBacteria 
Replicon accessionNC_002977 
Strand
Start bp1429614 
End bp1430699 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content61% 
IMG OID637170524 
Productsodium/bile acid symporter family protein 
Protein accessionYP_113807 
Protein GI53804544 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGCGTAT TCGATCGCTA CCTGACTCTC TGGGTAGTTC TTTGCATCGT CGGCGGCGTT 
GGCCTGGGGC ATTTCTTCCC CGCGATGTTT CAGGCGGTGG GACGATGGGA AATCGCCAAG
ATCAATCTCC CGGTCGCAGC GTTGATCTGG TTGATGATCA TTCCCATGCT GCTCAAGATA
AACCTCAAAG CGCTGAAAGG CGTCGTGCAG CATTGGCGCG GAGTTGGTGT GACCCTGCTG
GTAAACTGGG GCGTGAAGCC GTTCTCGATG GCGTTCCTGG GATGGCTCTT CGTCGGCAAC
CTGTTTCGCC CATGGCTGCC GTCCGACCAG ATCGACGCCT ACATCGCCGG TTTGATACTC
CTGGCGGCAG CACCCTGCAC GGCCATGGTG TTCATCTGGA GCCATCTCGT GCGTGGCGAA
CCGCACTTCA CCTTATCACA GGTGGCACTC AACGACGTCA TCATGGTCTT CGCCTTCGCG
CCCCTCGTCG GCCTGCTGCT GGGACTGTCG GCCATTACTG TGCCCTGGGA CACATTGCTC
CTGTCGGTGC TGCTCTACAT CGTGGTGCCG CTGATGCTGG CGAACTTCGG GCGTGCCCTG
ATGTTGCGCC ACGAGCATGG GTATGCGCGC CTGGACCGAC TGTTGCAGGT TTTGCATCCC
GTGTCGCTGA CGGCACTCCT GGCCACGTTG GTGCTGCTGT TCGGCTTCCA GGGTGAGCAG
ATCTTGAGCC AGCCGTTGGT GATCCTTCTC CTGGCGGTGC CCATTCTGAT CCAGGTGTTT
TTCAATTCGG GGCTGGCCTA TCTCCTGAAC CGCGCCGTTC ACTCCCCCCA TTGCATCGCC
GGCCCCTCTG CGCTGATCGG CGCCAGCAAC TTCTTCGAAC TGGCGGTTGC CACCGCTATC
GCTCTGTTCG GTTTCGAATC CGGAGCGGCC CTGGCGACAG TAGTCGGTGT CCTGATCGAG
GTACCTGTGA TGCTGCTGGC CGTGGGCGTT GTCAACCGCA GTCGCGCCTG GTATGAGCTG
CGTTCAGGCG TTCCCAGCCA CGAAGAATGC TGCCCGGTGG ACGCGCATCG GCAGGCAGGC
AAATGA
 
Protein sequence
MSVFDRYLTL WVVLCIVGGV GLGHFFPAMF QAVGRWEIAK INLPVAALIW LMIIPMLLKI 
NLKALKGVVQ HWRGVGVTLL VNWGVKPFSM AFLGWLFVGN LFRPWLPSDQ IDAYIAGLIL
LAAAPCTAMV FIWSHLVRGE PHFTLSQVAL NDVIMVFAFA PLVGLLLGLS AITVPWDTLL
LSVLLYIVVP LMLANFGRAL MLRHEHGYAR LDRLLQVLHP VSLTALLATL VLLFGFQGEQ
ILSQPLVILL LAVPILIQVF FNSGLAYLLN RAVHSPHCIA GPSALIGASN FFELAVATAI
ALFGFESGAA LATVVGVLIE VPVMLLAVGV VNRSRAWYEL RSGVPSHEEC CPVDAHRQAG
K