Gene Mpal_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0431 
Symbol 
ID7271457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp447192 
End bp450092 
Gene Length2901 bp 
Protein Length966 aa 
Translation table11 
GC content56% 
IMG OID643569076 
ProductCarbohydrate binding family 6 
Protein accessionYP_002465528 
Protein GI219851096 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.174739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATTC TGAGCCGTAC AGAGATCCGG AGGTCTGCCC TCCTCCTGGC TCTGATACTA 
TTATCTGGAT GCCTGCTGAT CCCAGGGGCA TTCGCAGTCA CGGTATCGAC CGGCACTTAT
CAGATTGATG CTGTGGGAAA TACAACGACC ATCCCTATTA TGATGGATCA GGCACCCAAC
GGATTCGCCT ATTATAAGAT CCAGATCAGT CTTACGAATC CGGATGTTGC AACGATCACA
GATTGTTCAT TCCCAACCTG GCTGGGACTG AATCTGAATA CAGCACTCCC GGATAAGAGT
GTGGTGATCC GGGGAGGAGA CCTTACCAAG GCCCATGTCA AAGCAGGGGA TACGAATATC
CTCCTCGCTA CGCTGACTGT ACAGGGTACT GCAGTTGGGA CAACTCCGAT CACAGTGACA
GTCTCTCCTC CAACCTATGT GGTTCAGGAT AAGGATGTCG TTAACTATGC AGTCACTACC
CCTGCCGGTC AACTGACCGT CGGTACCCCG GTTACCCCTA CCCCGACGGT GCAGCCCCCG
GTCGCTGACT TCTCGGTCGA CACGACTAAT GGAACAGCAC CGATGACGGT TCATATCACC
GATAAGTCTG TGAACGCGAC CACGATCAAG TACAGTCTTG GGGATGGAAC ATCAACAGGT
TCACTCTCTC CAGGAGGTGT CACCCCGTAC ACCTATGTTG CGGCAGGTAC CTTCACGCTT
ACTCAGACCG CGACCAATGC AGCAGGGTCA ACCAACAAGA CTGTGACAAT AACGGTTTTG
GGAGCACCGA CAACGACCCC AACGGTCACG CCTACCACTC CAACAGTGAC CCCTACAACT
CCAACGGTTA CGCCTACCAC TCCAACAGTA ACCCCGACTA CTCCAACCGT TACGCCAACC
CCGACGGTAA CTGTTACACC CGGCACCCTG ATTGCGGACT TCCAGGTGGC GACCACTCCA
GGCAGCAACC TGGTCAAGAT CACTGATAAA TCGACGAACG CGTCGACGAT CAGGTACAAC
CTCGGTGACG GAGTCTCCAC CAAGAATATG CCTCCGAACG GACCGGTGCT CTCCTACTAT
TACATCACCG CGAACACCTA CACGATCACC CAGACTGCGG TCAGTGCGAC TGGACAGACC
GCCACCAAGC AGGTGACGGT GACCGTCGGC GGCGCTCCAA CCATCACCAC AACGACGACG
ACCACCACCA CAGCGACGAC CACGGTCACT CCGACCGTCA CTGTCACACC GGGGTATCCG
GTGCCTGACT TCGCTCTCAC CACTCCAAGT TCGATGGGTA TCCAGATTGT CGACAACTCG
GTGAACGCGA CCTCGGTGAA ATACGATCTC GGTGACGGAA CCACGACCAA ACTCACTCAG
TTCCGGTACA CCTACTGGCA GGCCGGGACC TACACAATCA CGCTGACCGC AACCAATGCG
ATGGGCTCCT CGGTGAAGAC GGTCCAGGTG ACTGTGCCGG CTACTGTACC GACGACCACC
ACGCCCACAC CGACTGTTAC GGTTACATCG ACCGTTACAG CCACACCAAG CGGTCAACAG
CCATACAACG GACCGCATAA CGCTCCTGGC AAGGTCGAGG CAGAGGACTA TGACCTCGGC
GGCAACAACG TCGCCTACTA TGACACCACA TCAGGTAACG CAGGTGGGTT ATACCGCCAC
GACGATGTTG ATATCGAAAT AGAGGATGGC CCAAGCGGCT ATGATGTCGG TTACATCATC
GGCGGCGAAT ACCTCACTTA CTCAGTCGAT GCAGCAGCAG ACGGTGACTA CCCGATCACA
CTCAAAGTAG CCAACCCTGA CGCAGCTGCA AAGACGGTGA CCGTCTCAAC TGGTGGTGCT
TCGACCTCGG TCAGTATTTC ATCCACCGGC TCATTCGACA CCTACAACTC CTTCAACTCT
GCAGGCACGC TTCATCTCGA GCAGGGCAGA AACATCGTGA AGGTGGACTT CGGTGCCTCA
AGGATGAATT TCGATTACTT CACGATCGGG ACCGGTTCAC AGCCAACCAC AACTGTGACC
ACAACAGCAA CGACAGCTCA GCCAACCACG ACTGTGACCA CCACAGCAAC GACGGCTCAG
CCAACTGTGA CCACCACAGC GACGACGACT CAGCCAACCA CCACGACGGT CCAGCCGACC
GGCACCCCGC TCTCGACCCC TTACTATATG ATCAGCATGG TTCCAGGGCA TATCGAGTCA
TATTCCTATG ACAATGGCGG TGAAGGGGTC GCATATCATG ACACCACCAC AGCCAACCTC
GGTGGAAAAC TCCGTTCAGA TGGCGTGGAT ATTGAGTACA GCCAGTCAGA TGCCGGCTAT
AACATCGGGT ATGTTGCTGC TGGCGAATGG TTGATCTACT CGGTTCAGAT CGACACGGCA
GGGACCTACA CGGCTGCCAT CCGGGCATCG AACCCAGATA GCACGGACAA GCACATCGTC
CTCTCGCTCG AAGGGAGCAG CACTCCACTG GCGACCGTGA CCGTTCCGCC GACCGGTTCA
TTCGATACAT ACCAGACCAT CACAACGACC GTACAGCTGC CTGCTGGCAG GCACTGGCTG
AAGTTGAGCT TCCCAGAGAG CAGGGTGAAC CTCAGGTTTA TGGATATCAC CCGCGGGGCA
TCGACCGGAC CGATCGCAGC GACCGGCACA CCTACCCTGC TCGTCTCAAC CACCCCAGTT
GAGACCATTG TGCCTACCAC CCCGGCATCA GTGATCGACT CGATCACCAC CGCTACACCA
CTGGTGACCA TCTCCCCAGT GATCACTACC CCAGTGATCA CTACTCCGGT GATCACTACT
CCAGTGATCA CTACCCCTGT GATCACACAG GCGGCGGTCA TCACCCCAGA AGCAGAGAAT
AACACCTCAG TCAAGGTCTG A
 
Protein sequence
MNILSRTEIR RSALLLALIL LSGCLLIPGA FAVTVSTGTY QIDAVGNTTT IPIMMDQAPN 
GFAYYKIQIS LTNPDVATIT DCSFPTWLGL NLNTALPDKS VVIRGGDLTK AHVKAGDTNI
LLATLTVQGT AVGTTPITVT VSPPTYVVQD KDVVNYAVTT PAGQLTVGTP VTPTPTVQPP
VADFSVDTTN GTAPMTVHIT DKSVNATTIK YSLGDGTSTG SLSPGGVTPY TYVAAGTFTL
TQTATNAAGS TNKTVTITVL GAPTTTPTVT PTTPTVTPTT PTVTPTTPTV TPTTPTVTPT
PTVTVTPGTL IADFQVATTP GSNLVKITDK STNASTIRYN LGDGVSTKNM PPNGPVLSYY
YITANTYTIT QTAVSATGQT ATKQVTVTVG GAPTITTTTT TTTTATTTVT PTVTVTPGYP
VPDFALTTPS SMGIQIVDNS VNATSVKYDL GDGTTTKLTQ FRYTYWQAGT YTITLTATNA
MGSSVKTVQV TVPATVPTTT TPTPTVTVTS TVTATPSGQQ PYNGPHNAPG KVEAEDYDLG
GNNVAYYDTT SGNAGGLYRH DDVDIEIEDG PSGYDVGYII GGEYLTYSVD AAADGDYPIT
LKVANPDAAA KTVTVSTGGA STSVSISSTG SFDTYNSFNS AGTLHLEQGR NIVKVDFGAS
RMNFDYFTIG TGSQPTTTVT TTATTAQPTT TVTTTATTAQ PTVTTTATTT QPTTTTVQPT
GTPLSTPYYM ISMVPGHIES YSYDNGGEGV AYHDTTTANL GGKLRSDGVD IEYSQSDAGY
NIGYVAAGEW LIYSVQIDTA GTYTAAIRAS NPDSTDKHIV LSLEGSSTPL ATVTVPPTGS
FDTYQTITTT VQLPAGRHWL KLSFPESRVN LRFMDITRGA STGPIAATGT PTLLVSTTPV
ETIVPTTPAS VIDSITTATP LVTISPVITT PVITTPVITT PVITTPVITQ AAVITPEAEN
NTSVKV