Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3307 |
Symbol | |
ID | 3626801 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | + |
Start bp | 4246554 |
End bp | 4247675 |
Gene Length | 1122 bp |
Protein Length | 373 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637702142 |
Product | hypothetical protein |
Protein accession | YP_306767 |
Protein GI | 73670752 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.501382 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.0574639 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAAA TTATTATTTG GATATTGATC ATATTCTGCC TGGTGCAGTC CGCTATTTTT TCCGGAATGA CAATTGGACT TTTCAGCCTT GGTAGACTCA GGCTTGAAAT TGAAGCCGAA GCAGACAGCA AAGATGCTAT CAAAATTCTG CAGATCCGGC GGGACTCAAA CTTCCTGCTT ACGACACTGC TCTGGGGAAA TGTAGGCATA AATGTCCTGA TTGCCCTACT TACAGGTTCC GTGCTGACAG GAGCCTCAGC TTTCCTCTTC TCTACTTTTG TAATCACCAG TTTTGGAGAG ATTGTACCCC AGGCTTATTT TTCCCGAAAT GCCCTTTCAA TTGGAGCAAA ACTAACTCCT TTAGTCCGGT TCTACCAGAT GCTGCTCTAT CCGGTAGCCA AGCCTACGGC CCTTATTCTT GACTGGTGGC TCGGCAGGGA AAAACTTGAA CTCTTCAAGG AACAGTCCAT GCGGATTATG CTCGAAAAGC ATATTGAGTC GGGAAAGTCT GATATTGGCA CTTTTGAGGG AATAGGGGCT CTGAACTTTC TCTCCATAGA CGACGTCAGT ATCTCCGATG AAGGCTCGCT AATAGACCAG AGAAGCATAA TCTCACTCCC GGTTGAAAAT AACCGTCCGG TATTTCCTCC TTTCAAAAGA GAACCAGAAG ATCCTTTTCT GCAAAAGATA GAAGCCTCCG GAAAAAAATG GGTAATCATT ACCAACCCTC AGGACGAGCC TGTCATGGTG CTTGACGCGG ACGGCTTCCT AAGGGATGCA GTCTACAAGA AAGGCCCATT TATTCCGCTT TCTTACTGCC ACTTCCCGGT TGTGGTGAGA TCTCCCAAAA CCAGGCTTGA GAAAGTAATC CGGCAGTTTA AGGTGTATCC GCAATACCCT GAAGATGACG TGATCGATCA GGATCTTATC CTCTACTGGG ACCAGGAGAA AAGAATTATT ACGGGTTCGG ACATTCTGGG CCGGCTGCTA CGGGGAATTG TAGTGGAGTG TGACCTGAAA TCAGGGTGCG AGACGCCTGT TCCGCCTTCC CAGCCTGGAG TTGTCAGAAG AAGTTTGAGA AGAGGAAAGA AGAAAGAAAG CGAAGAGCAG AAAAAAGAAT GA
|
Protein sequence | MNEIIIWILI IFCLVQSAIF SGMTIGLFSL GRLRLEIEAE ADSKDAIKIL QIRRDSNFLL TTLLWGNVGI NVLIALLTGS VLTGASAFLF STFVITSFGE IVPQAYFSRN ALSIGAKLTP LVRFYQMLLY PVAKPTALIL DWWLGREKLE LFKEQSMRIM LEKHIESGKS DIGTFEGIGA LNFLSIDDVS ISDEGSLIDQ RSIISLPVEN NRPVFPPFKR EPEDPFLQKI EASGKKWVII TNPQDEPVMV LDADGFLRDA VYKKGPFIPL SYCHFPVVVR SPKTRLEKVI RQFKVYPQYP EDDVIDQDLI LYWDQEKRII TGSDILGRLL RGIVVECDLK SGCETPVPPS QPGVVRRSLR RGKKKESEEQ KKE
|
| |