Gene Mlab_1670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_1670 
Symbol 
ID4796062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp1703788 
End bp1705026 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content53% 
IMG OID640100360 
ProductTPR repeat-containing protein 
Protein accessionYP_001031098 
Protein GI124486482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00502897 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000193878 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCCATC ATCACGAAAA ACCGCCGAAA ACGAAATCGG CGAAGTACAT TACCATCGGG 
CTCATCACCC TCACGGTCAT CATGTTCATC GTCGCATTCC ACCCGTTTGG CACGATCATT
ACCACGCCGT TTCCAAAAGA CGAAGTGATC ATCGCGGTTT CCATGCCGTT TGAAGGAGAA
ATGGCAGAGT TCGGCATCGA ATATATGCGG GGAATCGAAC TCGCGGTCGA AGACATCAAT
GATGAAGGCG GCATCCGGGG CGTCCCGGTC CGGGTCGAAT ACTACAACAA CAAAGGAAAC
GTCACGCTCG CAAAAGCCCA GTTCAAAGCG ATCAAAGAAA GCGGAGTCCC AGTTGTGATC
GGCGCATTGA CAAGCACAGT GACGCTTGCC CTTGCCCCGT ATGCGGAGTC ATACGAGATC
GTTCTTATCT CTCCATCGGC AACATCTGCC GGTCTCTCGG CATACGGCAA TTACGTATAT
CGAACGGTCT CTTCCGACTT CTATCTCGGC GCCGGCATGG CGAAGATCAT CGGCGGCAGA
AACGAAACGC AGAATGTGAT GATGATAAGT CTCGACAACA GTTACGGAAA AAGTCTCAAA
TACGCCTTCA TGAACGAAGC CAACAGTTCC TATCCGGATA TGCATATCGT CTCCGCCATA
TCGGTCCCCG ACTCTAATAC GGTGAACACG ACCGAGATCA TTGCCGAAAT GAAGAAAACC
GACCCCCAGT CCGTTCTGCT GATCGTAAAC CCGAGCCAGT GCATAGAGAT CATGCTGGCC
GCAGAAAAGG AGGGACTCGA CCCGACCTGG TTTGGCTCGG ACATGGTGAC CAACCGGCAG
GTCCCTCAGG AAGTCGGTGA ATACTCGGAA GGCCTCATCG GTTTTTCCCA GGCGAGAAGG
ATCTCCGACC CCTCATACGA AGAGCATTAC GAAGAAACCT TCGGAGAAGC GATGATGACC
CGCGACTCGA TCTACGGATA CGACACGATG ATCGTGGTGT CCCAGGCAAT CGAACACAGC
GGATACACGG CGGACGGCAT CAGGGAAGGT CTCGACCTGA TCAGACATGT CGGCCTTACC
GGAACGATCG TCTTCGACGA AAAAGGAGAT GCCTATCCGT CGTATGATGT TATGCGGCTT
CAAAACGGCA AATGGGTCGA TCTACCGTGG AGAGAGGTCC TGACCTTCGA GAAGAAAGCG
GCTGCGATAT CCTCGGCTCA CGGCACCTCC TCCCACTGA
 
Protein sequence
MSHHHEKPPK TKSAKYITIG LITLTVIMFI VAFHPFGTII TTPFPKDEVI IAVSMPFEGE 
MAEFGIEYMR GIELAVEDIN DEGGIRGVPV RVEYYNNKGN VTLAKAQFKA IKESGVPVVI
GALTSTVTLA LAPYAESYEI VLISPSATSA GLSAYGNYVY RTVSSDFYLG AGMAKIIGGR
NETQNVMMIS LDNSYGKSLK YAFMNEANSS YPDMHIVSAI SVPDSNTVNT TEIIAEMKKT
DPQSVLLIVN PSQCIEIMLA AEKEGLDPTW FGSDMVTNRQ VPQEVGEYSE GLIGFSQARR
ISDPSYEEHY EETFGEAMMT RDSIYGYDTM IVVSQAIEHS GYTADGIREG LDLIRHVGLT
GTIVFDEKGD AYPSYDVMRL QNGKWVDLPW REVLTFEKKA AAISSAHGTS SH