Gene Snas_3474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3474 
Symbol 
ID8884673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3673937 
End bp3675235 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content67% 
IMG OID 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_003512231 
Protein GI291300953 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.455292 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCATC GACATGACCG GTTGCTGCAC GTCGCGGCGC TGGCCCGCGT TGAAGGCGAA 
GGCGCGATGC GGGTCCGGGT CACCGGCGAC ACCGTCACCG AGGTCCAACT CGACATCTAC
GAACCACCCC GGTTCTTCGA AGCCCTGCTG CGCGGACGCG GCCACACCGA ACCACCCGAC
ATCACCTCCC GCATCTGCGG AATCTGCCCG GTGGCGTACC AGATGAGCGC GTGCGCGGCC
ATTGAGGACG CGTGCGGCAT CACGGTGGAC AGTGGCATCC AAGACCTGAG AAGGCTGCTG
TACTGCGGCG AATGGATATC GAGCCACGCG CTGCACATCT ACCTGCTGCA CGCTCCCGAC
TTCCTGGGGT ACTCCGGGGC CGTCGACATG GCGCGTGACC AACGCCCCGT CGTCGAGCGG
GGACTGCGGC TGAAGCAGGC GGGAAACATG ATCATGGAAC GCCTCGGGGG CCGGGCGATT
CACCCGGTGA ATGTCCGGAT CGGCGGTTTC CACCGGCTGC CCACGCGCAC CGAGCTGGAT
TCCCTCGTGG CTCCGTTGAC CCGGGCGCTC GACGACGCAC TCGACACGGT GACCATGGCC
GCCGGTTTCG ACTTCCCCGA GTACGAGTGC CGACATGAAT GGCTGGCCCT GGTCGACCCG
CGACAGCGCT ATCCCATCGA CGGCGGCGTG CCACACACCT CGCAAGGCTC GTTTCCGTTG
CGCGAGTACA CCTCCCACGT CGTCGAGCAC CAGGTGTCCC ACTCCACCGC GTTGCACGCG
CGGCTGGCGG ACGGTTCGCG ACCGCTGACC GGCCCCCTCG CCCGATACGC GCTCAACCAC
GACCGGCTGT CGCCCCTGGC CCGGCAGACG GCGCGCTCGG CGGGTCTGGA TTCCGTGTGC
CGCAACCCGT TTCGCAGCAT CATCGTGCGG GCGGTCGAAA CCGTGTACGC GGTGGAGGAG
GCGCTGCGTA TCATCGCCTC CTACGAGCCC CCACCCCGCC CCGCCGTCCC GGTGCCGCCG
GTGGCCGCCA TCGGATACGG TGCCACGGAG GCACCCCGCG GCGTGCTCTT CCATTCCTAT
ACATTGGACG ACAGTGGAAC CGTGCTGGCG GCGAACATCG TGCCGCCAAC GGCCCAGAAC
CAGACCGCGA TGGAACACGA CCTACGGGGT TTCGTCCAGG ACCACCTCAC CTTGGACGAC
CACAGCCTCA CCCATGCCTG CGAACAGGCG ATCCGCAACT ACGACCCGTG CATCTCGTGC
AGCACCCACT TCCTCGACCT GACGGTGGAA CGGGGCTGA
 
Protein sequence
MTHRHDRLLH VAALARVEGE GAMRVRVTGD TVTEVQLDIY EPPRFFEALL RGRGHTEPPD 
ITSRICGICP VAYQMSACAA IEDACGITVD SGIQDLRRLL YCGEWISSHA LHIYLLHAPD
FLGYSGAVDM ARDQRPVVER GLRLKQAGNM IMERLGGRAI HPVNVRIGGF HRLPTRTELD
SLVAPLTRAL DDALDTVTMA AGFDFPEYEC RHEWLALVDP RQRYPIDGGV PHTSQGSFPL
REYTSHVVEH QVSHSTALHA RLADGSRPLT GPLARYALNH DRLSPLARQT ARSAGLDSVC
RNPFRSIIVR AVETVYAVEE ALRIIASYEP PPRPAVPVPP VAAIGYGATE APRGVLFHSY
TLDDSGTVLA ANIVPPTAQN QTAMEHDLRG FVQDHLTLDD HSLTHACEQA IRNYDPCISC
STHFLDLTVE RG