Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpal_0431 |
Symbol | |
ID | 7271457 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosphaerula palustris E1-9c |
Kingdom | Archaea |
Replicon accession | NC_011832 |
Strand | + |
Start bp | 447192 |
End bp | 450092 |
Gene Length | 2901 bp |
Protein Length | 966 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643569076 |
Product | Carbohydrate binding family 6 |
Protein accession | YP_002465528 |
Protein GI | 219851096 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.174739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATTC TGAGCCGTAC AGAGATCCGG AGGTCTGCCC TCCTCCTGGC TCTGATACTA TTATCTGGAT GCCTGCTGAT CCCAGGGGCA TTCGCAGTCA CGGTATCGAC CGGCACTTAT CAGATTGATG CTGTGGGAAA TACAACGACC ATCCCTATTA TGATGGATCA GGCACCCAAC GGATTCGCCT ATTATAAGAT CCAGATCAGT CTTACGAATC CGGATGTTGC AACGATCACA GATTGTTCAT TCCCAACCTG GCTGGGACTG AATCTGAATA CAGCACTCCC GGATAAGAGT GTGGTGATCC GGGGAGGAGA CCTTACCAAG GCCCATGTCA AAGCAGGGGA TACGAATATC CTCCTCGCTA CGCTGACTGT ACAGGGTACT GCAGTTGGGA CAACTCCGAT CACAGTGACA GTCTCTCCTC CAACCTATGT GGTTCAGGAT AAGGATGTCG TTAACTATGC AGTCACTACC CCTGCCGGTC AACTGACCGT CGGTACCCCG GTTACCCCTA CCCCGACGGT GCAGCCCCCG GTCGCTGACT TCTCGGTCGA CACGACTAAT GGAACAGCAC CGATGACGGT TCATATCACC GATAAGTCTG TGAACGCGAC CACGATCAAG TACAGTCTTG GGGATGGAAC ATCAACAGGT TCACTCTCTC CAGGAGGTGT CACCCCGTAC ACCTATGTTG CGGCAGGTAC CTTCACGCTT ACTCAGACCG CGACCAATGC AGCAGGGTCA ACCAACAAGA CTGTGACAAT AACGGTTTTG GGAGCACCGA CAACGACCCC AACGGTCACG CCTACCACTC CAACAGTGAC CCCTACAACT CCAACGGTTA CGCCTACCAC TCCAACAGTA ACCCCGACTA CTCCAACCGT TACGCCAACC CCGACGGTAA CTGTTACACC CGGCACCCTG ATTGCGGACT TCCAGGTGGC GACCACTCCA GGCAGCAACC TGGTCAAGAT CACTGATAAA TCGACGAACG CGTCGACGAT CAGGTACAAC CTCGGTGACG GAGTCTCCAC CAAGAATATG CCTCCGAACG GACCGGTGCT CTCCTACTAT TACATCACCG CGAACACCTA CACGATCACC CAGACTGCGG TCAGTGCGAC TGGACAGACC GCCACCAAGC AGGTGACGGT GACCGTCGGC GGCGCTCCAA CCATCACCAC AACGACGACG ACCACCACCA CAGCGACGAC CACGGTCACT CCGACCGTCA CTGTCACACC GGGGTATCCG GTGCCTGACT TCGCTCTCAC CACTCCAAGT TCGATGGGTA TCCAGATTGT CGACAACTCG GTGAACGCGA CCTCGGTGAA ATACGATCTC GGTGACGGAA CCACGACCAA ACTCACTCAG TTCCGGTACA CCTACTGGCA GGCCGGGACC TACACAATCA CGCTGACCGC AACCAATGCG ATGGGCTCCT CGGTGAAGAC GGTCCAGGTG ACTGTGCCGG CTACTGTACC GACGACCACC ACGCCCACAC CGACTGTTAC GGTTACATCG ACCGTTACAG CCACACCAAG CGGTCAACAG CCATACAACG GACCGCATAA CGCTCCTGGC AAGGTCGAGG CAGAGGACTA TGACCTCGGC GGCAACAACG TCGCCTACTA TGACACCACA TCAGGTAACG CAGGTGGGTT ATACCGCCAC GACGATGTTG ATATCGAAAT AGAGGATGGC CCAAGCGGCT ATGATGTCGG TTACATCATC GGCGGCGAAT ACCTCACTTA CTCAGTCGAT GCAGCAGCAG ACGGTGACTA CCCGATCACA CTCAAAGTAG CCAACCCTGA CGCAGCTGCA AAGACGGTGA CCGTCTCAAC TGGTGGTGCT TCGACCTCGG TCAGTATTTC ATCCACCGGC TCATTCGACA CCTACAACTC CTTCAACTCT GCAGGCACGC TTCATCTCGA GCAGGGCAGA AACATCGTGA AGGTGGACTT CGGTGCCTCA AGGATGAATT TCGATTACTT CACGATCGGG ACCGGTTCAC AGCCAACCAC AACTGTGACC ACAACAGCAA CGACAGCTCA GCCAACCACG ACTGTGACCA CCACAGCAAC GACGGCTCAG CCAACTGTGA CCACCACAGC GACGACGACT CAGCCAACCA CCACGACGGT CCAGCCGACC GGCACCCCGC TCTCGACCCC TTACTATATG ATCAGCATGG TTCCAGGGCA TATCGAGTCA TATTCCTATG ACAATGGCGG TGAAGGGGTC GCATATCATG ACACCACCAC AGCCAACCTC GGTGGAAAAC TCCGTTCAGA TGGCGTGGAT ATTGAGTACA GCCAGTCAGA TGCCGGCTAT AACATCGGGT ATGTTGCTGC TGGCGAATGG TTGATCTACT CGGTTCAGAT CGACACGGCA GGGACCTACA CGGCTGCCAT CCGGGCATCG AACCCAGATA GCACGGACAA GCACATCGTC CTCTCGCTCG AAGGGAGCAG CACTCCACTG GCGACCGTGA CCGTTCCGCC GACCGGTTCA TTCGATACAT ACCAGACCAT CACAACGACC GTACAGCTGC CTGCTGGCAG GCACTGGCTG AAGTTGAGCT TCCCAGAGAG CAGGGTGAAC CTCAGGTTTA TGGATATCAC CCGCGGGGCA TCGACCGGAC CGATCGCAGC GACCGGCACA CCTACCCTGC TCGTCTCAAC CACCCCAGTT GAGACCATTG TGCCTACCAC CCCGGCATCA GTGATCGACT CGATCACCAC CGCTACACCA CTGGTGACCA TCTCCCCAGT GATCACTACC CCAGTGATCA CTACTCCGGT GATCACTACT CCAGTGATCA CTACCCCTGT GATCACACAG GCGGCGGTCA TCACCCCAGA AGCAGAGAAT AACACCTCAG TCAAGGTCTG A
|
Protein sequence | MNILSRTEIR RSALLLALIL LSGCLLIPGA FAVTVSTGTY QIDAVGNTTT IPIMMDQAPN GFAYYKIQIS LTNPDVATIT DCSFPTWLGL NLNTALPDKS VVIRGGDLTK AHVKAGDTNI LLATLTVQGT AVGTTPITVT VSPPTYVVQD KDVVNYAVTT PAGQLTVGTP VTPTPTVQPP VADFSVDTTN GTAPMTVHIT DKSVNATTIK YSLGDGTSTG SLSPGGVTPY TYVAAGTFTL TQTATNAAGS TNKTVTITVL GAPTTTPTVT PTTPTVTPTT PTVTPTTPTV TPTTPTVTPT PTVTVTPGTL IADFQVATTP GSNLVKITDK STNASTIRYN LGDGVSTKNM PPNGPVLSYY YITANTYTIT QTAVSATGQT ATKQVTVTVG GAPTITTTTT TTTTATTTVT PTVTVTPGYP VPDFALTTPS SMGIQIVDNS VNATSVKYDL GDGTTTKLTQ FRYTYWQAGT YTITLTATNA MGSSVKTVQV TVPATVPTTT TPTPTVTVTS TVTATPSGQQ PYNGPHNAPG KVEAEDYDLG GNNVAYYDTT SGNAGGLYRH DDVDIEIEDG PSGYDVGYII GGEYLTYSVD AAADGDYPIT LKVANPDAAA KTVTVSTGGA STSVSISSTG SFDTYNSFNS AGTLHLEQGR NIVKVDFGAS RMNFDYFTIG TGSQPTTTVT TTATTAQPTT TVTTTATTAQ PTVTTTATTT QPTTTTVQPT GTPLSTPYYM ISMVPGHIES YSYDNGGEGV AYHDTTTANL GGKLRSDGVD IEYSQSDAGY NIGYVAAGEW LIYSVQIDTA GTYTAAIRAS NPDSTDKHIV LSLEGSSTPL ATVTVPPTGS FDTYQTITTT VQLPAGRHWL KLSFPESRVN LRFMDITRGA STGPIAATGT PTLLVSTTPV ETIVPTTPAS VIDSITTATP LVTISPVITT PVITTPVITT PVITTPVITQ AAVITPEAEN NTSVKV
|
| |