Gene Slin_5036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5036 
Symbol 
ID8728801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6139880 
End bp6141166 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content50% 
IMG OID 
Productcitrate synthase I 
Protein accessionYP_003389812 
Protein GI284039882 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAACT CCGCTGAATT AACCGTCGAT GGTAAAACAT ATTCATTTCC AACCTTAGAG 
GGAACCGAAC ATGAAAAAGC CTTCGACATC TCGAACCTTC GTGATCAGAC TGGCTACGTT
ACCTTAGATC GTGGTTATAA AAACACCGGT GCCACCAAAA GTGCCATCAC ATTTCTGGAT
GGTGAGCTGG GCATTCTGCA ATACCGAGGG TATTCAATTG AAGATTTAGC CGCCAAGGCG
TCGTTTCTGG AAGTTGCCTA TTTGTTGATC TATGGTGAAC TGCCTACGCA GGAAGAGTAC
CATACCTTCG AAAATGCCAT TCGTCGGCAT ACGCTGGTGA ACGAAGGCAT GCGGACAATC
TTTAACGGGT TCCCGGTCAA TGCGCACCCA ATGGGTGTAC TGGCGTCGAT GGTTAGCGCC
ATGAGTGCTT TCTACCCTGA GTTGGAGAAT GGCAAAGAAG AAGACAAAAC CGACCTGCAC
ATTATCCGGT TGCTGGCCAA GCTGCCAACC ATTGCCACCT GGTCGTACAA GCGCTCGATG
GGCCATCCGA CCAACTACCC GAAGAATAAC CTCGACTACA TCCCGAACTT CCTGAATATG
ATGTTCGCGC TGCCCGTCGA AGACTATAAG GTCGATCCGG TCGTTGCCGA AGCCCTAAAC
GTACTGCTTA TTCTTCATGC CGACCACGAG CAGAACTGCT CCACATCAAC GGTACGTCTG
GTTGGATCGT CGCAGGCTAA CCTGTACTCG TCAATTTCGG CGGGCATTAG TGCCTTATGG
GGTCCGCTGC ATGGTGGTGC AAACCAGGAA GTGATTGAAA TGCTGGAAAA TATTAAAGCC
GATGGTGGCG ATGTTTCCAA GTATGTAGAA ATGGCCAAGA ACGCCAAAAC GACGGGCTTC
CGTTTGTTCG GGTTCGGTCA CCGGGTTTAC AAAAACTTCG ATCCCCGCGC TAAAATTATC
AAGAAAGCTG CTGATGATGT ATTAGCTAAG CTGGGCGTAA ACGACCCCGT TCTTGAAATC
GCCAAAGGTC TTGAAGAGGC CGCGTTGAAC GATGAATACT TCGTATCGCG CAAATTGTAC
CCGAATGTGG ATTTCTACTC GGGTATAATC TACCGCGCGC TGGGTATCCC AACGAATATG
TTTACGGTCA TGTTCGCTAT CGGCCGCCTA CCGGGTTGGA TTGCCCAATG GAAAGAAATG
CGCGAAACGA AAGAGCCGAT CGGTCGGCCT CGTCAGATTT ATACGGGAGC TACCCTACGG
GAGTTTGTTC CGCTGGAGAA CCGGTAA
 
Protein sequence
MANSAELTVD GKTYSFPTLE GTEHEKAFDI SNLRDQTGYV TLDRGYKNTG ATKSAITFLD 
GELGILQYRG YSIEDLAAKA SFLEVAYLLI YGELPTQEEY HTFENAIRRH TLVNEGMRTI
FNGFPVNAHP MGVLASMVSA MSAFYPELEN GKEEDKTDLH IIRLLAKLPT IATWSYKRSM
GHPTNYPKNN LDYIPNFLNM MFALPVEDYK VDPVVAEALN VLLILHADHE QNCSTSTVRL
VGSSQANLYS SISAGISALW GPLHGGANQE VIEMLENIKA DGGDVSKYVE MAKNAKTTGF
RLFGFGHRVY KNFDPRAKII KKAADDVLAK LGVNDPVLEI AKGLEEAALN DEYFVSRKLY
PNVDFYSGII YRALGIPTNM FTVMFAIGRL PGWIAQWKEM RETKEPIGRP RQIYTGATLR
EFVPLENR