Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | MCA1346 |
Symbol | |
ID | 3104605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylococcus capsulatus str. Bath |
Kingdom | Bacteria |
Replicon accession | NC_002977 |
Strand | - |
Start bp | 1429614 |
End bp | 1430699 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637170524 |
Product | sodium/bile acid symporter family protein |
Protein accession | YP_113807 |
Protein GI | 53804544 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0798] Arsenite efflux pump ACR3 and related permeases |
TIGRFAM ID | [TIGR00832] arsenical-resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.17677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGCGTAT TCGATCGCTA CCTGACTCTC TGGGTAGTTC TTTGCATCGT CGGCGGCGTT GGCCTGGGGC ATTTCTTCCC CGCGATGTTT CAGGCGGTGG GACGATGGGA AATCGCCAAG ATCAATCTCC CGGTCGCAGC GTTGATCTGG TTGATGATCA TTCCCATGCT GCTCAAGATA AACCTCAAAG CGCTGAAAGG CGTCGTGCAG CATTGGCGCG GAGTTGGTGT GACCCTGCTG GTAAACTGGG GCGTGAAGCC GTTCTCGATG GCGTTCCTGG GATGGCTCTT CGTCGGCAAC CTGTTTCGCC CATGGCTGCC GTCCGACCAG ATCGACGCCT ACATCGCCGG TTTGATACTC CTGGCGGCAG CACCCTGCAC GGCCATGGTG TTCATCTGGA GCCATCTCGT GCGTGGCGAA CCGCACTTCA CCTTATCACA GGTGGCACTC AACGACGTCA TCATGGTCTT CGCCTTCGCG CCCCTCGTCG GCCTGCTGCT GGGACTGTCG GCCATTACTG TGCCCTGGGA CACATTGCTC CTGTCGGTGC TGCTCTACAT CGTGGTGCCG CTGATGCTGG CGAACTTCGG GCGTGCCCTG ATGTTGCGCC ACGAGCATGG GTATGCGCGC CTGGACCGAC TGTTGCAGGT TTTGCATCCC GTGTCGCTGA CGGCACTCCT GGCCACGTTG GTGCTGCTGT TCGGCTTCCA GGGTGAGCAG ATCTTGAGCC AGCCGTTGGT GATCCTTCTC CTGGCGGTGC CCATTCTGAT CCAGGTGTTT TTCAATTCGG GGCTGGCCTA TCTCCTGAAC CGCGCCGTTC ACTCCCCCCA TTGCATCGCC GGCCCCTCTG CGCTGATCGG CGCCAGCAAC TTCTTCGAAC TGGCGGTTGC CACCGCTATC GCTCTGTTCG GTTTCGAATC CGGAGCGGCC CTGGCGACAG TAGTCGGTGT CCTGATCGAG GTACCTGTGA TGCTGCTGGC CGTGGGCGTT GTCAACCGCA GTCGCGCCTG GTATGAGCTG CGTTCAGGCG TTCCCAGCCA CGAAGAATGC TGCCCGGTGG ACGCGCATCG GCAGGCAGGC AAATGA
|
Protein sequence | MSVFDRYLTL WVVLCIVGGV GLGHFFPAMF QAVGRWEIAK INLPVAALIW LMIIPMLLKI NLKALKGVVQ HWRGVGVTLL VNWGVKPFSM AFLGWLFVGN LFRPWLPSDQ IDAYIAGLIL LAAAPCTAMV FIWSHLVRGE PHFTLSQVAL NDVIMVFAFA PLVGLLLGLS AITVPWDTLL LSVLLYIVVP LMLANFGRAL MLRHEHGYAR LDRLLQVLHP VSLTALLATL VLLFGFQGEQ ILSQPLVILL LAVPILIQVF FNSGLAYLLN RAVHSPHCIA GPSALIGASN FFELAVATAI ALFGFESGAA LATVVGVLIE VPVMLLAVGV VNRSRAWYEL RSGVPSHEEC CPVDAHRQAG K
|
| |