Gene Slin_4368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4368 
Symbol 
ID8728128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5296947 
End bp5298209 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content55% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003389148 
Protein GI284039218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0158935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.395097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAG TACTTATCTC TGCCTACTCC TGCATTCCCG ACCGAGGGTC GGAGGAGGGG 
AACGGCTGGT TCTACTCCTC GCTGGTCAGC CAGCAGGGGT ATCAGGTCTG GTGCCTGACC
AGAGACATTG GCAAAGCCGA AATCGAACAA AAACTACGAC AGTGCGACTA CCCTAACCTG
CATTTTGAAT TCGTCACGCT GCCCCGCTGG GCCAACAAAG CCGGTTCCTT AGGTTTGCTG
GGCATGTATT TTCATTACCT CTACTGGCAA TGGACCGCCT TGCAGTCAGC CCGAAAACTA
GCTAGGACGC ACCAGTTCGA CTTGGTGCAC CATGTGAGTT ATACCAGTCT GCAACTCGGC
AGCTACCTCT ATAAACTCGG CCTGCCCTTT ATCTACGGGC CGGTTGGTGG TGGTCAGGAG
GCCCCTGCCA ATATGCGCCA CTATTTCAAA TCATACTGGC TGAAAGAAAA GATGCGGTCG
TGGGTAAGCG ACCTTATGCT TCATTTCAAT CCCGGCTGTT ACCAGTCGGT TCGCCGGGCC
GACTACGTGC TGGCCTGGAA CGAAGACACC CGCCGGATGA TTGCCTCGAT GGGCCGAACC
CAGGGTGTTG AGAAAGAATT CGGTGGTGTT GGAGCCAGCT TCATCCCCTC GAAACCCATC
CACCGCCCGG CTCATGATTC TCTGGAACTG GTTTGGGTTG GGCGGCTGAT GCCCCGTAAA
GCACTTGAGC TTTCCCTGCA CGGCATGAGC AAAGTGGACC CTCGGCTGCC CATTCATCTA
ACCATTGTCG GCGATGGCGA AATGGGGCAG TATGTCCCTG AATACATGGC GAAATACAAC
CTCGACAAAC GGGTTACGTG GGTTGGTAAA GTCAATTATG AGCAGGTGAA GGAATACTAC
CGCAAGGCCG ATGCCTTTCT GTTTACAAGC CTGCGCGACA CTGGCCCCGC TCAGTTAATG
GAGGCCATGG GTTACTCATT ACCGGTGGTG ACGCTGAATT TACACGGCCA GGCCGAACTG
GTGGACGATT CGACGGGTAT CCGGGTGCCC GTTACAACCC CCGAAGCCGT GGCGCAGGGG
TTGGCCGAAG CAATTACCTG GATGTATGAC AATGAACAGA AGCGGATTGA TATGGGCTTC
AATGCCTTTC AGTTTGCCCA ACGACAACGG TGGGAGCTGA AAGTAGCCCA CGTTGTTCAC
CGATACTATA CCGCCCTGAT AGGCCAGGCC GCCAGCGTGG CAATGGCCTT GAATCAGGAG
TAA
 
Protein sequence
MKKVLISAYS CIPDRGSEEG NGWFYSSLVS QQGYQVWCLT RDIGKAEIEQ KLRQCDYPNL 
HFEFVTLPRW ANKAGSLGLL GMYFHYLYWQ WTALQSARKL ARTHQFDLVH HVSYTSLQLG
SYLYKLGLPF IYGPVGGGQE APANMRHYFK SYWLKEKMRS WVSDLMLHFN PGCYQSVRRA
DYVLAWNEDT RRMIASMGRT QGVEKEFGGV GASFIPSKPI HRPAHDSLEL VWVGRLMPRK
ALELSLHGMS KVDPRLPIHL TIVGDGEMGQ YVPEYMAKYN LDKRVTWVGK VNYEQVKEYY
RKADAFLFTS LRDTGPAQLM EAMGYSLPVV TLNLHGQAEL VDDSTGIRVP VTTPEAVAQG
LAEAITWMYD NEQKRIDMGF NAFQFAQRQR WELKVAHVVH RYYTALIGQA ASVAMALNQE