Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbur_0314 |
Symbol | |
ID | 3997470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanococcoides burtonii DSM 6242 |
Kingdom | Archaea |
Replicon accession | NC_007955 |
Strand | - |
Start bp | 289064 |
End bp | 292129 |
Gene Length | 3066 bp |
Protein Length | 1021 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637958145 |
Product | cadherin |
Protein accession | YP_565066 |
Protein GI | 91772374 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCATA ATAGATTCAG TAGATGGGGT AATAAAACTC TGATTGTGTT GGTTTTTGTA ATTCTTGTGG TACTCGGTAT ATCTGTACAA GCATCTGTTC CGAGTTTTAC AAATTTAGAC AATTCAACGA TCGAAGAGCA GAATATATCG ATCATGGACA ATGATCTTGC TATCACTAAT TCAGATGGGA CTGGTTATGG TGATGGGTAT ATTGAGTTTA TCCTTAGCAA TTCTAATTCT TATGATGATT TTGATCTAAA CAGTTCTTCT TCTCCGAATT CAAATGGTGA AATTTCCATC AATGGTAGTG ATGTTTTTGT TGGGACTGGC AGTGGAGTTG TTAAAATCGG TATCATTAAT GCAACATATG ATGGTCAGGA TGGTCAAAAA CTCCGAGTTG ATTTCGTAAC AGAAACAGCA AGTTTGCCCA CTAATAATAA TTTTGAAACC GGTAATATGG ATGGATGGGC TATCAATTCA AGTATATCTT CTATCCCGGA CACTCCTGAA CTAAGAGCAG TTATGTACGA AACTGTTGAC GGTAATCCTG ATGTTGGGGG GGGCTATGAT TTCAACAACA TATTGGATAG TGCTAGTCAA TTTGCAACTG CAGATACTGA TGCTAGTCAG GGGACCTACA GTTTGAAACT ATCCAATCGT GGGACTACTA ACCTGGGTTT TGGTTATATT TGGGGTCCTT CCGCAATAAG TTCAAACTTT TCAGCTGCTG CGGGGGATAG AATATTGTTT GATTGGATGG CAACTCAAAC AAGCGATTAT TATGTGGCCT ATGCAATACT GATTGACAAC ACTAACTCTG AGACCAGCAT ATTGTTTGCA GACCAGGGGT CAAGCAGCAC ATGGCAAACT CAGCAGGTAC TTGTAGCAAA GAACAGTTCT GATCTACAAT TTAAATTTAT TCTTGGTAGT TATGATAATA GTGGTGGGAG GGCAGTTGGT GCAACAATGC GTATTGATAA TATTCGAGCG CAGTTGTTGA CTGATGACAT CGTTAGTTCC TTAGCTCGTT CACTGACTTA CAATTACACT GACAGTAGTC CTTCAGGTGA TGTTATTTCA GGCAGAACCT ATACTATCAC AGTTAAAGAT GGGGATGGGG AACTTGCTTC TGGCACTTCT ACACTGTTTG TCTATGGCAA TGCTCCAGTG TTTAATTCTG CTGATAATTA TTCAGTAGCT GAAAATGAAA CGAGTTTGGC ATTAATGTTG TATGATACAC AAGCAGACAA TGGTGATGGT GGTCTAAATG ATTCCAATAT CACATATTTA TTTGCAGGAG GGGATGGACA AAGTCTGTTT GCCATCGACT CAGATGATGG TGAAATACGT CTGACTTCCT TGGGTACTGT CACTTTGGAC TATGAGGCAA AAACAAACTA TACTCTTCAG GTCCTTGTGA CCGATCAGCA AGCTGCTAAT AACACACTCA TTGAAAACGT CACAGTCAAT GTAATTGATG CCAATGACCC TGCAATAGCC GCGTCACCAC TCTCCCAGGG TGCAGTTAAT GGTACATACG TTCCTATGAT CCCTGAATTT AGCTGGACCT TCTCTGATAT CAATGCTGGC GATGGCCAGG CTGCTTATCA GTTGCTCGTA TCATCTTCAT CTTCAAATCT AACGTCCGAA AATGGTGATA TGTGGGATAC AGGTAAATTG ATGAGTAGCT CATCAAATAA CATTACTTAC AATGGTTCCT CTCTAAATGG GGGACAAACA TATTACTGGA AAGTAAAATT ATGGGATTCC TATGACGATA CGCGTTTTTA TTGCCCTGGA CAAACGTTTT CAACACGTGG TCCATCACTT ATCCTTGGAA ATGTTGTTAG CATTGAACAG GACTGGGGAA TCAATTTCAA CATAGATCAT TCTGTAACTA GCACTGAAAC AGATGCTGAC AATGTTAATG TTACTTATAA TGTTCCGTGG TTAACTGGTT CTTCCTTGGG ATCAATTAAC ACTGATGACT TGAAGTGGTG CAATCATACA TTGTTGAACT CAACGGTCGC AGAGAACGTC ATTAGGGTCT ATTCTAATAC TACACAAACC AGTGCATCCA ATGATTCTGA AGTATTCTAC ATAAACATCA CTAAACGAGA TATTGAGGTT GTAATTTCAC CTTCTTCTCA ATCTATTGCT CCAGGTGCTA CTATATGGGT AAGTGCAACT GTGGAAGGGG AATATGGTGA AACTTTCATT GGGAATGCAG TGTTCTTGAG GGATGGTATT GGAATTGGCA CAACACAAGC AGTTACTGAT GGGAATGCAA GTTTTAATAC AACAGAATCT TCGTTGGGAA CACACATCTT CTCGATCGAG TTCTATAATA CGACACTTTA CCACAACACT TCTGCTGTCT CCTTGGTCAC AGTAAGTGAT CCACAAACAA GTGATAATGG AGTAAGGGTT CATACAAAGT TAGGTCAACC TTCTGAAAAT GTTAGGTCAA CAGTTTCCGA ACTGAAACAT GTCATGGGTG GCAATAAGGT AGAGTTTACG TTCTCGGCTG GTGATGCTCC TGTCCTTGGG GTCAGCTTTG ATGCAAAAGA CAATGAAGGT GTTGTTGTTG CATCCGTTCA GGTGTTGAAT GAAGTTCCCG ATGACGTTGG GTCGCCATCT GGTATATCCT ATGAGTTCAT GAGCATTAAT GTTGGATCTG ATGGAATGAT CTCAGAACAC AATGCTGACA ATATCGTAAT CAACTTTAAA GTTAGTCGGG AATGGATCAA AGAAAATAAC ATTGATCCAT CTACAATAAG ACTTTCACGA TATCATGAGG TGTGGCAAGA TCTTCCTTCT TCACAGACCC GGGATGACGA TGAGTTCCTT TACTTCGTTG CTTATACTCC GGGCTTCTCA TTTTTCTCCA TTGTCGGAGA TGAAGTTGGA ACCAGTGTGT CTGAAGAGGA GGTCGTCATA ACTCCACAAC AGACAATAGA GGACGAAGAA CCGGTTGAAA AGGATAAACC AATACTTGCA GTATTTGGCA TAGCAATTGT ATTGGGAATG GCGACAGTGA TAGTAAGCAA AAGAAATAAA AAATGA
|
Protein sequence | MMHNRFSRWG NKTLIVLVFV ILVVLGISVQ ASVPSFTNLD NSTIEEQNIS IMDNDLAITN SDGTGYGDGY IEFILSNSNS YDDFDLNSSS SPNSNGEISI NGSDVFVGTG SGVVKIGIIN ATYDGQDGQK LRVDFVTETA SLPTNNNFET GNMDGWAINS SISSIPDTPE LRAVMYETVD GNPDVGGGYD FNNILDSASQ FATADTDASQ GTYSLKLSNR GTTNLGFGYI WGPSAISSNF SAAAGDRILF DWMATQTSDY YVAYAILIDN TNSETSILFA DQGSSSTWQT QQVLVAKNSS DLQFKFILGS YDNSGGRAVG ATMRIDNIRA QLLTDDIVSS LARSLTYNYT DSSPSGDVIS GRTYTITVKD GDGELASGTS TLFVYGNAPV FNSADNYSVA ENETSLALML YDTQADNGDG GLNDSNITYL FAGGDGQSLF AIDSDDGEIR LTSLGTVTLD YEAKTNYTLQ VLVTDQQAAN NTLIENVTVN VIDANDPAIA ASPLSQGAVN GTYVPMIPEF SWTFSDINAG DGQAAYQLLV SSSSSNLTSE NGDMWDTGKL MSSSSNNITY NGSSLNGGQT YYWKVKLWDS YDDTRFYCPG QTFSTRGPSL ILGNVVSIEQ DWGINFNIDH SVTSTETDAD NVNVTYNVPW LTGSSLGSIN TDDLKWCNHT LLNSTVAENV IRVYSNTTQT SASNDSEVFY INITKRDIEV VISPSSQSIA PGATIWVSAT VEGEYGETFI GNAVFLRDGI GIGTTQAVTD GNASFNTTES SLGTHIFSIE FYNTTLYHNT SAVSLVTVSD PQTSDNGVRV HTKLGQPSEN VRSTVSELKH VMGGNKVEFT FSAGDAPVLG VSFDAKDNEG VVVASVQVLN EVPDDVGSPS GISYEFMSIN VGSDGMISEH NADNIVINFK VSREWIKENN IDPSTIRLSR YHEVWQDLPS SQTRDDDEFL YFVAYTPGFS FFSIVGDEVG TSVSEEEVVI TPQQTIEDEE PVEKDKPILA VFGIAIVLGM ATVIVSKRNK K
|
| |