Gene Slin_3886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3886 
Symbol 
ID8727644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4661090 
End bp4662685 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content56% 
IMG OID 
Productnickel-dependent hydrogenase large subunit 
Protein accessionYP_003388675 
Protein GI284038745 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.713091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACTC AGAAAGAACT AAATATTTCG CCGGTAGGGC GGGTCGAGGG CGACCTCGAT 
GTGAAGGTAT ATATGGAAGA CGGTGTCGTG ACGCGGGCGC ATACACAGGC GGCCATGTTC
CGGGGCTTCG AGAAAATTAT GGAGGGCAAA GACCCGCAGG CAGGGTTGAT TGTAACCCCT
CGTATCTGCG GAATCTGCGG AGGGTCGCAC CTTTACTGCG CTTCGTCGGC GCTGGATACG
GCCTGGAAAA CGACCCTGCC GCCCAATGCG CTGTTGCTGC GCGCCATTGG GCAGGCTACC
GAAACCATTC AGAGTATCCC CCGCTGGTTC TACGCCATTT TCGCTACGGA CATGGCGAAC
AAAAAATTCG CCAACAAACC ACTCTACGAT GAAGTTGTGA AGCGGTTTGC GGCTTATGTA
GGAACGTCAT TTCAGCGGGG CGTTACCGCC AGCGGACGTC CGGTAGAGGT GTACGCGCTT
TTCGGTGGGC AGTGGCCGCA TTCGAGTTAC ATGGTGCCCG GTGGCGTCAT GTGTGCGCCT
ACGCTCAAAG ACATCACCCG CGCCCATGCC ATCATGAACC AGTTCCGGAA AGACTGGCTC
GAAACACTAT GGCTGGGCTG TTCCATCGAG CGATACATGC AGATCAAAAC CTGGGACGAC
CTAATGGCCT GGGTGGAGGA AAACGACTCG CAACGGAATT CAGACCTGGG CCTGTTTATC
CGGGCCAGTC TGGAGTTTGG CCTGCATAAG TTCGGGCAGG GTGTGGGCAA GTTCGTTGCG
TACGGAACGT ATCTGCACAA AGACCATTAC CAGAAACCAA CCGTAGAAGG CCGCAATGCC
GCACTGATTA GCCGAAGTGG CTTCTTCGAT GGCAGCAAGT ATCACGTTTT CGATCACCTG
AGCATTAAAG AACACGTGGG CCATGCCTGG TATAAGGATG TGCCGGCGGC TCACCCCTGG
GACGAGCCTA TGCCAACGCC CCTGCAGTCG CACACGCTGC ACGATTCGAA TTTCAATGAG
AAATATAGCT GGTCAAAAGC ACCCCGTTAC ATGGATATGG CGGCCGAAGC CGGACCGCTG
GCCCGCGTGA TCATGAACGC CAATCCGGAT AATCTGCTGC CGCATCAGGT CTACGATCCG
CTGTTTGGCG ATGTGCTGGA CAAAATGGGC GCTAACGTAT TTACGCGAAC GTTAGCCCGT
GTTCACGAAG CGGCCCGCCT GTACACCCAA ATTGATGAGT GGCTCCGGCA GATCGACCTC
AACAGCGAGT TCTATATCAA ACCGGAGGAG CGCGACGGCA AAGGGTTCGG CGCTACCGAA
GCGGCCCGTG GGGCGCTTGC CCACTGGATT GAAATTGAGA ACGGCGTTAT TAAAAACTAC
CAGGTCATGG CTCCTACAAC CTGGAACGTA GGACCCAACG ACGATCGGGG CAACCCCGGC
CCAATCGAAG CGGCTCTCGA AGGCACCGAA ATCGAGGACC CGCACGACCC CGTCGAAGTG
GGTATGGTAG CCCGTTCATT CGATTCCTGT CTGGTTTGTA CCGTCCACGC CCACGACGGC
AAATCGGGTG AGCAACTGGC CAAATTTAAA TTATGA
 
Protein sequence
MATQKELNIS PVGRVEGDLD VKVYMEDGVV TRAHTQAAMF RGFEKIMEGK DPQAGLIVTP 
RICGICGGSH LYCASSALDT AWKTTLPPNA LLLRAIGQAT ETIQSIPRWF YAIFATDMAN
KKFANKPLYD EVVKRFAAYV GTSFQRGVTA SGRPVEVYAL FGGQWPHSSY MVPGGVMCAP
TLKDITRAHA IMNQFRKDWL ETLWLGCSIE RYMQIKTWDD LMAWVEENDS QRNSDLGLFI
RASLEFGLHK FGQGVGKFVA YGTYLHKDHY QKPTVEGRNA ALISRSGFFD GSKYHVFDHL
SIKEHVGHAW YKDVPAAHPW DEPMPTPLQS HTLHDSNFNE KYSWSKAPRY MDMAAEAGPL
ARVIMNANPD NLLPHQVYDP LFGDVLDKMG ANVFTRTLAR VHEAARLYTQ IDEWLRQIDL
NSEFYIKPEE RDGKGFGATE AARGALAHWI EIENGVIKNY QVMAPTTWNV GPNDDRGNPG
PIEAALEGTE IEDPHDPVEV GMVARSFDSC LVCTVHAHDG KSGEQLAKFK L