Gene Mhun_3090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMhun_3090 
Symbol 
ID3921967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanospirillum hungatei JF-1 
KingdomArchaea 
Replicon accessionNC_007796 
Strand
Start bp3366840 
End bp3367820 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content47% 
IMG OID637898700 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_504496 
Protein GI88604318 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID[TIGR03589] UDP-N-acetylglucosamine 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.196445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.352446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTCT TCGACAATAA AACCATACTC ATAACCGGCG GAACCGGATC TTTCGGCAAC 
GCGTTCACCT CCCGCCTTCT CAATAACCAT AATCCGCATA GTATCCGGAT CTACTCGCGT
GGAGAATATC TCCAATGGAA GATGCAGCAA AAGTTTTCCG ACAGCCGCAT CAGGTTTTTC
ATTGGTGATA TCCGAGATAA AGCCCGTCTC ACTCGCGCCT TGAATGATGT CGATATCGTG
GTTCACGCAG CAGCGCTTAA ACAGGTTCCA GCATGTGAGT ATAATCCGAT TGAAGCCGTG
AGAACCAACA TCGACGGGAC AACAAACCTT ATAGATACAT CAATTGACAA CAATGTCGAC
CGGCTCATAG CCCTGAGCAC TGACAAGGCT GTTCACCCGG TCAATCTCTA CGGCGCAACA
AAGATGGTAG CGGAGAAACT GTTTATCCAG GGGAATGCAT ATTCAGGTAA GAAAACAACC
CGGTTTTCCT GTGTCAGGTA TGGAAATGTG GTTGGAAGCA GAGGAAGCAT CGTTCCCTTA
TTTAAGATGC AAAAAGAAGA GGGAAAGATT ACTATAACTG ATCCCCGTAT GACCAGGTTC
TGGCTTACCC TGGACCAGGG TGCAGCCTTT GTTGAAAATT GTACCCAGAT TATGAATGGA
GGAGAGATAT TTGTGCCCAA GATCCCCAGC ATGAAGATCA CCGACCTTGC AGAGGCTATA
GCTCCTGGTA TTCCCCATGA GTATATCGGC ATCAGACCTG GAGAAAAGAT CCATGAAGTT
CTTATTACCG AAGATGAAGC CCGCCATACC CGCGATCTTA AAGAATACTA TATTATAGAT
CCGGAGATAT CGTTCTGGAA CGGGCATAGA AAGGATTATT CTTACACACT CCCTGAAGGG
TTCCGGTATT CCAGCGAGAC CAATACCGAA TGGCTGGATG AAGAGGGATT AAAGCAGATG
CTTGCTGAAT CCCACCCATA A
 
Protein sequence
MSFFDNKTIL ITGGTGSFGN AFTSRLLNNH NPHSIRIYSR GEYLQWKMQQ KFSDSRIRFF 
IGDIRDKARL TRALNDVDIV VHAAALKQVP ACEYNPIEAV RTNIDGTTNL IDTSIDNNVD
RLIALSTDKA VHPVNLYGAT KMVAEKLFIQ GNAYSGKKTT RFSCVRYGNV VGSRGSIVPL
FKMQKEEGKI TITDPRMTRF WLTLDQGAAF VENCTQIMNG GEIFVPKIPS MKITDLAEAI
APGIPHEYIG IRPGEKIHEV LITEDEARHT RDLKEYYIID PEISFWNGHR KDYSYTLPEG
FRYSSETNTE WLDEEGLKQM LAESHP