Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3733 |
Symbol | |
ID | 3625023 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 4818675 |
End bp | 4821878 |
Gene Length | 3204 bp |
Protein Length | 1067 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 637702565 |
Product | hypothetical protein |
Protein accession | YP_307175 |
Protein GI | 73671160 |
COG category | [S] Function unknown |
COG ID | [COG1520] FOG: WD40-like repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.422005 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.383912 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAA AAATGCGTCA ACTAATTTTG ATACCTATAG TAATGCTCGC AATATTGCAA AGTATACCAC TGGCATTATC TGAAGAGGAA AACAACTGGC CACAGTTTCA GAATAACATA GAACATACCG GTTTTTATGC AGGAGAATTA CCGGATGCAT TCCAGTTACA ATGGAAGAGT AGTGGAGTAG GCGCAGTTAT AGATTCCGCG CCTGTAACCG CTGAAAACAT GGTTTTTGTT ATAAGTAGGT CCAATTCATT GAAAGCTTTG AATGTTACAA CAGGTGATGT AGTATGGACT GCGAATACGG GAACAGATAA GTATGGTTCA TGGTCATCTC CTGCTTATGA TGATGGGATG GTGTTTGCTT CCAGAGGTAC TGACACGATT TGTGTTTATG CTTCAAACGG CACTGAAAAA TGGAAATTCA CTAATCCGAG TGGTCAAACA TCATGTAATG CCGGGCCATC AATAGCTAAT GGAAAAGTCT TTTGTTCTGA CTGGGGAGGA AGTCATTATT ATTGTGTGGA CGAGTATACT GGAAAACTCT TGTGGACTTA TTCAGTAATA GGTTCCGCAC AAGGTGTCCC TTCCTATAAA GACGGAAAGG TCTTCTTAAC AAGTGGGTAT GGTACAATAA ATGTGGGCTC GAATGTATAT GAAGGACAGG TATATTGTGT AGATGCAGAA AACGGCTCGA AAATATGGAC CACTCCAATT ACAGACAACG TATTATGTTC TGCCTCCGTA GGAAACAATG CTGTATATGT TACGTCTTAT GATTTCTACA CCGATAAAAC TACTGCATTA AGCGCGTTGG ATATAAATAA TGGGCAGATC CTCTGGGAAC AAAACATTCC AAGAACGGAC TCCACTCCTG CACTTGCATA TGGGAATGTT TACGTCTCTC CGGTATATTG TGTTAATGCT TCTTCAGGAG AAATTATCTG GAGCGGCAGT GCCGGGGGAT GGACGAATTC CATGACAGTT GCAGATGGAA AAGCTTTTGC AGGCAAAAAA ACTTCGTCAT ATGGTTATGA TCACATTGTG GAATATGATG CATATACAGG CGATGTTTTA TGGGAATCCA CGGCTGGAGG AGCTTGTTCT ATTGCAAATG GTAGTATATA TACAATTGGC GATGACGGCG AAGTCTATGC CTATGGGCAG GCTGATCCTT ATCTAGATAT GGTTGTCCAA AATGTTTCAG TGGTCGCAGG AAGCGTCTAC CCCTATTATC CGAATGAAAT TGCTGCAACT ATTAGAAATA ACGGCAACAC GTATGCTGAC AACGTATCCG TATCCTTCCT GGTAAATGGT GAACAGAAAG ATAACCTTGT AACAAAGATC GGAAGTAACC TGGCCAAAAA TGTGAGTTTT TCCTGGACAC CAGAAGCTCC AGGGGATTAT AACATAACGG TTGAAGCTCA TGCTACAGGG TCCGTTCCTG AAAATAATGA TGCAGATAAT TCTAGGGGAA TAAATGTAAC TGCTTTAGCT GGTGATGCAG ACCTTATCCC TGTATCGATC ACACCTTCTG CTATCTTTTC AAATACTTCA TATGAAATGA AAGCAGTTGT GAAAAACCAG GGTACTTCCA TGGCAAGTAA TTTTACAGTG ACCGTAAAAG AAGGAACAAA TGAACTGGCT GCAAAGACTT TCGAACAGCT TGGTCCTTCC CAGAGCGCTG AACTAAATTT TACATGGGAA TCCCAGGAGG CCGGAAACTT TGGGTTTACA GTATTTGCTG ATACGGAGAA CAATGTATCG GAAAGTGATG AAACTAATAA CCAGGTGACT CTGCCTATTA CGGTAAAACC GGAAACAGTA ATTGAAGCAA AATCTGCAAT ATACTGGACA CAATTTCAGG GGGGAAGTGA TAGAAATGGT GTTACTGAAG GATATGCACC CCTTGATGAT TCTGTTAAAC TTAAGTGGAG TGCGGATGAT TTTGGTGGGA ATATCGATCT TTGCCCAATA GTTGTAGGAG ATAACGTCTA TATACTAGCT TCCAGTGGAG AATTGTATGC TTATAACAAA GCAGAAGGTA AACCAATATG GCATGCAACA CTTGATGCTG CTTCGGTTTT ACACTCTTCA ACACCCGCAT ATGGAGACGG AAATCTCTTT GTATTAACTG AAGGTGGAAA TCTTTATGCC TATAACGCAA GTACAGGGGT CCAGAAATGG AAAGTACATG TAACGGATGT AGGTCCAGAA AGTCCGGTTA CGTATTACGA TCACAGGATA TACGTTGCAG AAGGTCTTGA AGGCGGGGTG GATACAAAAT ACTATTATTG CTACGATGAC CTTGGAAACC TTTTATGGAA ACATGCAACT CAGAACACAT CCGGCTTCAT ATGGAACGGT GCATCCGTTG TAGGAGATTA TCTGATATAT TCTACCCATG AAGGGAATCT CACATGTCTT GACAGGAAAA CAGGAGCGCT TGTCGATGAA ATAAGCCTGG ACAGCGATGT GTCGAGTCGG ATTTTATTTG CTCTCCCTGA GCCGGGAAGA TTCCGTTCCT CTGTCGCTTA CCATGGCGGG TATGTCTATA CGACTTCAGA GTTAGCACAG GAAACTGGTT ATGTCTGGAA AGTAGGATTC GATGACTCTA CTGGTACATT CCTTAACCAG GGATGGCGCT CTGACCAGAT GTTCAGTACC TCGACTCCTG CAATCTATAA CGGTAAAGTA TACGTTGGAC AGGGAGAGCA TGGATATGAT GGTAAGATGA TCTGTTTGGA TGATAGTGAT GGTAAAAAAG TATGGGAATA CCATGTGGAT GCAGGTGTGA AATCCTCTCC TGCAGTTTCT ACGTATTATG GAACACCTCG TATTTATTTC ACAACTGCAG AGGATAACGG TTCCCTTTAC TGCCTGAATG AATCAGGAGG TCTTGTGTGG GAATATAATC CTCCTGACGA TGGGTATATC CTGCAGGGAG TAGCACTTTC TCAGGGAAAA GCTTACTTTG GAACAGATGG AGGAAATCTA TACTGTGTTG AAGGAGATTG GAATGTTTTC AATGATCCAG ATTCGGAGTC CGGAGCGTAT ATTTCACTTG ACGAATTACA GACTGCCGTG CTTCATTGGA AAAAAGGTAT TTCAATAGAC TCTAACTACA AAATTTCACT AAACAATATA CAATCGATGG TTATGTACTG GAAAAACAAT TCGCCTATGA AGTTCAATAA GTGA
|
Protein sequence | MKQKMRQLIL IPIVMLAILQ SIPLALSEEE NNWPQFQNNI EHTGFYAGEL PDAFQLQWKS SGVGAVIDSA PVTAENMVFV ISRSNSLKAL NVTTGDVVWT ANTGTDKYGS WSSPAYDDGM VFASRGTDTI CVYASNGTEK WKFTNPSGQT SCNAGPSIAN GKVFCSDWGG SHYYCVDEYT GKLLWTYSVI GSAQGVPSYK DGKVFLTSGY GTINVGSNVY EGQVYCVDAE NGSKIWTTPI TDNVLCSASV GNNAVYVTSY DFYTDKTTAL SALDINNGQI LWEQNIPRTD STPALAYGNV YVSPVYCVNA SSGEIIWSGS AGGWTNSMTV ADGKAFAGKK TSSYGYDHIV EYDAYTGDVL WESTAGGACS IANGSIYTIG DDGEVYAYGQ ADPYLDMVVQ NVSVVAGSVY PYYPNEIAAT IRNNGNTYAD NVSVSFLVNG EQKDNLVTKI GSNLAKNVSF SWTPEAPGDY NITVEAHATG SVPENNDADN SRGINVTALA GDADLIPVSI TPSAIFSNTS YEMKAVVKNQ GTSMASNFTV TVKEGTNELA AKTFEQLGPS QSAELNFTWE SQEAGNFGFT VFADTENNVS ESDETNNQVT LPITVKPETV IEAKSAIYWT QFQGGSDRNG VTEGYAPLDD SVKLKWSADD FGGNIDLCPI VVGDNVYILA SSGELYAYNK AEGKPIWHAT LDAASVLHSS TPAYGDGNLF VLTEGGNLYA YNASTGVQKW KVHVTDVGPE SPVTYYDHRI YVAEGLEGGV DTKYYYCYDD LGNLLWKHAT QNTSGFIWNG ASVVGDYLIY STHEGNLTCL DRKTGALVDE ISLDSDVSSR ILFALPEPGR FRSSVAYHGG YVYTTSELAQ ETGYVWKVGF DDSTGTFLNQ GWRSDQMFST STPAIYNGKV YVGQGEHGYD GKMICLDDSD GKKVWEYHVD AGVKSSPAVS TYYGTPRIYF TTAEDNGSLY CLNESGGLVW EYNPPDDGYI LQGVALSQGK AYFGTDGGNL YCVEGDWNVF NDPDSESGAY ISLDELQTAV LHWKKGISID SNYKISLNNI QSMVMYWKNN SPMKFNK
|
| |