Gene Slin_5804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5804 
Symbol 
ID8729579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7047036 
End bp7048292 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content50% 
IMG OID 
Productsodium/hydrogen exchanger 
Protein accessionYP_003390568 
Protein GI284040638 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0577051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.596223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCTAT TTACCCTGAT TACATTACTT ATTGTTGCTT CAGCGTTACT TGCCTACCTA 
AATACAAGGT GGCTCAAACT ACCAGACGCC ATTGGTATTA TGGTGCTGTC ACTATGTTTT
TCGCTGTTGC TAATCGGAAT GAACGCCATT TATCCGGCCA GCCTTACACT GGTTCGGCAG
ACAGTTCGTG AAATCGATTT TGGAAAAGCT CTTTTCGACG TCATGTTAAG CTTCCTGCTT
TTTGCGGGGG CCTTTCATAC CGATGCAGCC AAATTACGGG TCGAACGGCG GTCGGTCATG
CTATTTGCCT TCGTGGGTGT ACTGCTCAGC ACGGTGTTGG TTGGCACGGG TGTTTATTAC
GTCACTAAAT TCCTGGGTAT AAATTTACCC TTCACTCTGT GTCTTTTATT TGGTGCGCTT
ATTTCCCCTA CCGATCCAAT TGCCGTATTA GGCATTCTGG CCAAGTTCAA ACTCCCGGAG
AGTGTTAAGC TCAACATTGT GGGTGAATCA CTATTTAATG ACGGGGTCGG CGTCGTTGTC
TTTGCCTCCA TTTATCAGAT TGTTCGCAAT GGTGCCGAGA GTATAAGCGC CGGAGAAATT
ATCTGGTTAT TCGTCGAAGA AGCTGGTGGG GGCGTCATTT TCGGAATAGC GCTGGGCTAT
CTGATGTACT GGCTCCTTCG CTCCATAAAC CACTACCAAA CCGAAGTAAT GATTACCGTA
GCCGCCGTAA TGGGTGGCTA CCTGCTGGCC CAGAGCCTGC ACATCTCGGG GCCATTGGCG
ATGGTGGTGG CGGGTTTGTT TGTGGGCGAT AACAACGCCC GTCAGGATGC CATGAGCCAG
ACAACGAAGC GTTATGTTGA CGGCTTCTGG GAATTGATCG ACAGCATTCT GAATGCCCTG
CTCTTTGTGC TGATCGGTAT CGAACTACTG GTGATCGACT TTCACATTTC GTACTGGACC
ATCTATCTGG TTGTCATCCT GATTGTCCTG CTGTCGCGGT ACCTGGCTAT TCTGATTCCG
TTTACACTAG CCCGTCGCTG GCTCGACCTC GATTCAAAAG CGCCCCTCAT GCTCACCTGG
GGCGGCTTAC GGGGTGGGCT TTCGATTGCT ATGGCTTTAT CGATCGATAA CGACTTACCC
CACAAAGACT TTATTGTAAG CGTCACCTAC GCGGTAGTTC TGTTTTCTGT AATCGGGCAG
GGGCTAACGA TGGAACGGCT CATCCGGCGG CTGTATCCAC CCGAAGAACA AGTCTGA
 
Protein sequence
MDLFTLITLL IVASALLAYL NTRWLKLPDA IGIMVLSLCF SLLLIGMNAI YPASLTLVRQ 
TVREIDFGKA LFDVMLSFLL FAGAFHTDAA KLRVERRSVM LFAFVGVLLS TVLVGTGVYY
VTKFLGINLP FTLCLLFGAL ISPTDPIAVL GILAKFKLPE SVKLNIVGES LFNDGVGVVV
FASIYQIVRN GAESISAGEI IWLFVEEAGG GVIFGIALGY LMYWLLRSIN HYQTEVMITV
AAVMGGYLLA QSLHISGPLA MVVAGLFVGD NNARQDAMSQ TTKRYVDGFW ELIDSILNAL
LFVLIGIELL VIDFHISYWT IYLVVILIVL LSRYLAILIP FTLARRWLDL DSKAPLMLTW
GGLRGGLSIA MALSIDNDLP HKDFIVSVTY AVVLFSVIGQ GLTMERLIRR LYPPEEQV