Gene Mhun_3042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_3042 
Symbol 
ID3922917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp3312649 
End bp3313935 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content49% 
IMG OID637898651 
ProductPKD 
Protein accessionYP_504448 
Protein GI88604270 
COG category[R] General function prediction only 
COG ID[COG3291] FOG: PKD repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.971034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCTTATG ACATGAAAGA TGTAAGAACT GCAATGGATC CGTCTGATAA CCACTATCAT 
TTCGTTCTTA TCATCATTGT TATATCACTC ATTATCGCTG CAGGAATATC TGTTTTATTT
ACCGCTGGAC CAGGTCCGGA ACAGAGCAGC AATACATCGC TTCTTCCTCA CATTTCCGAC
AGTTCCGGAC AGGACCTGAG TTCTGCTGAT AGCTGGATTG TACGAATTTT ATGGGTCGAT
CACCAGAACC GAACCTACTC TTTTCAGGAT TATCCACAAG AAACAAGACC GGATCCTGAT
GTGTATACCT TTCAAGGAGA TCCTCCTCCC GGGGCAGAGG CAGCCATGAT CCGGTTTCTC
TCCTGGGACG GGAGTACTGA ACGAACACAG GGAGCGCGCT ACATCATGGA CCAGGAGATG
GTAACATCTG TCCTGAACCG ATACAGCCTG ATGGCTTCCC TGTCAGCTCT GCCGGTTGCC
ACCCCTCTCG CTACCCAGAC TCCTGTTCCA TCCGGGACAC CGGTTCCGGG GATGCTGATT
CCTGAATCCG GAGTGGTATG CAGAAATGCC GATGGTTCTT ATACTGCCAG ATTCGGGTAT
ATATCCAGAC ATGATCACCC GGTTTCCCTT CCGGTCGGGG AATTAAATAT GTTTCATCCC
GGTTCTGCAG ACAGGGGTCA GCCGGTTGTT TTTCTGCCTG GAATTCACCA CGATGTCTTC
ACGATCACAT ACCCTGCTGA TGCAACCAAC CAGGCCTGGT CTTTGATGAA TAAACAAGTA
TCAGTCGGGA CGGTCCCTGA AGTAAATACC ATAATTACTA TCGAACCAGT ATCAGGATAT
GCGCCACTTG AGGTCAGATT CAGCCAGCGC TCAACCGGGT CGGTGACGAA CAATCCCCTG
TCAGGGACAT GGAATCTCGG AGATGGAACA ACAACGTCAG AATCAGATCC ATTCTTTCAC
CGGTATGAAA ACCCCGGACG ATATCTCGTC AGTTATACGG TCTCAAATCT TTGCAGCCAG
GCACGGGATA CGGGGGTCGT TGATGTGTAT CGTGCATCAT ATACCTGGGA AGATGATCCC
AATGATCCAG CCACCATCAG GTTTCATTCC ACATCCGGTG GGGATCCTGA CGTCTGGTTC
TGGGATTTTG GTGATGGATA TACCTCGTGG GAGGAGAATC CTGTTCACCG GTATCAGAAT
CCGGGAACCT ATAAAGTCGG TCTGACCTTG TCAGGAAAGC ATGGGAAAGG AACCGTAGTA
CATAGTATCA TTGTTCCTTC TTTATAG
 
Protein sequence
MAYDMKDVRT AMDPSDNHYH FVLIIIVISL IIAAGISVLF TAGPGPEQSS NTSLLPHISD 
SSGQDLSSAD SWIVRILWVD HQNRTYSFQD YPQETRPDPD VYTFQGDPPP GAEAAMIRFL
SWDGSTERTQ GARYIMDQEM VTSVLNRYSL MASLSALPVA TPLATQTPVP SGTPVPGMLI
PESGVVCRNA DGSYTARFGY ISRHDHPVSL PVGELNMFHP GSADRGQPVV FLPGIHHDVF
TITYPADATN QAWSLMNKQV SVGTVPEVNT IITIEPVSGY APLEVRFSQR STGSVTNNPL
SGTWNLGDGT TTSESDPFFH RYENPGRYLV SYTVSNLCSQ ARDTGVVDVY RASYTWEDDP
NDPATIRFHS TSGGDPDVWF WDFGDGYTSW EENPVHRYQN PGTYKVGLTL SGKHGKGTVV
HSIIVPSL