Gene Slin_3947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3947 
Symbol 
ID8727705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4733002 
End bp4734546 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content55% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388736 
Protein GI284038806 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA GTATCAGTCG ATTTCTTTTA TTCCTTCTCC CCCTTTGTGG GTTAGGGGGC 
CTTTCGGGCT GTGATGAAGG GTTTACCGAA CTGAATACCA ACCGTGTGAA CCCCACAGCG
CTGGCCCCTT CGCTGGTGCT TAACAAAGCC ATAATCAGCA CCACCTACCT CGATGGCTTT
GGCACACTGG GGATGCTGAC ATACAACTTT GGCATTGTGC AGCAGATCAT TACACCCTAT
GGCAGCTCGC TGTCGGGGGC TAATTACGAT CAGGTCAACG GCAGCAACAC GCCCCTGGTG
TGGGTTAATT TCTACCGTAA TGTGCTCAAG CAGTTAGTTG CCGTACTGGA CCAGACCAAG
AGTGATCCCC TACAAGTCAA CACGTATAAC GCGGCCCGCA TCTGGAAAGC CTACATCTTT
ATGATCCTGA CCGACACCTA TGGCGATGTG CCGTATTTCG AAGCCGGACA GGGCTATACG
AATGAGATCA TCACGCCCAA ATACGATGCG CAGCAGGCTA TCTACAAAGA TATTCTGAAG
GAACTCGACG AAGCATCGGC CGCGCTGACG ACCACCCAGG CCGCCGTAAC CACCGATATT
CTGTACGGGG GCGACGTTGC CAAATGGAAA AAGCTCGGCT ATTCGTTTAT GCTGCGGGCC
GCCATGCGCC TGACCAAAGT TGACCCGGCC ATGGCACAAT CGTACGTAGC CAAAGCTGTA
GCAGGTGGCG TATTCCAGTC CAACGCCGAC AACTCGATTA TCCGGCATAC GGCCATTTAC
AACAACTACA TCGCTAATCA CCTGGCCGCC CGCGAAAAAA CCAACTTCTA CCTCGCAGCC
CCCTTTGTGA ACTACCTGAA GGAGAACAAC GACCCGCGGC TGCCCATTTT CGCCGTGCGT
TACGTAGGTG CCAAAGGTGG TCAGGAACAA ATACCAGCCC GTGCCTCCTC CGACCCGAAA
GTACAGATCG GCATGCCGAT GGGCTACAAT GATGTATCCA TTACAACGAC ACTGGCCCAA
AACGGAGTGG CCAGCCTGTG GGATTACAGC CAGGTGAACC TGAACACGGT GCTCAAGCTG
GATGCACCCG AGTTTCACAT CACCTACTCG CAGGTCCAAC TGCTGTTGGC CGAAGCCGCC
GTTCGGGGCT GGGTGACGGG TGCTGCGGCC GACTACTACG CCAGAGGTAT CCGGGCCAAT
CTGGAACAAA TGGCTTCCTA TGGATCGGCC GTTTCGGAAG CCGATATAAA AGCGTATCTG
GACACTCACC CACTTGATGC GGCCAAAGCG CTGGAGCTGA TCAATACGCA ATACTGGGTG
GCGACCTTCC TGGATGGGAA CGAGTCCTTC GCCAATTTCC GGCGCAGCGG CTTCCCCACC
CTGAAGAAAA ACCCATACCC TGGCTCCGAA GTGAAAGGCG ACTTCATCCG GCGGCTACCT
TACCCGGATA GCGAAATCGT TGTCAACTCC GGCAGTTTGA ACGAAGCCAT CGCCCGGCAG
GGTCCCAACA CGCTCGATAC GCGGGTGTGG TGGGATAAGA AGTAA
 
Protein sequence
MKNSISRFLL FLLPLCGLGG LSGCDEGFTE LNTNRVNPTA LAPSLVLNKA IISTTYLDGF 
GTLGMLTYNF GIVQQIITPY GSSLSGANYD QVNGSNTPLV WVNFYRNVLK QLVAVLDQTK
SDPLQVNTYN AARIWKAYIF MILTDTYGDV PYFEAGQGYT NEIITPKYDA QQAIYKDILK
ELDEASAALT TTQAAVTTDI LYGGDVAKWK KLGYSFMLRA AMRLTKVDPA MAQSYVAKAV
AGGVFQSNAD NSIIRHTAIY NNYIANHLAA REKTNFYLAA PFVNYLKENN DPRLPIFAVR
YVGAKGGQEQ IPARASSDPK VQIGMPMGYN DVSITTTLAQ NGVASLWDYS QVNLNTVLKL
DAPEFHITYS QVQLLLAEAA VRGWVTGAAA DYYARGIRAN LEQMASYGSA VSEADIKAYL
DTHPLDAAKA LELINTQYWV ATFLDGNESF ANFRRSGFPT LKKNPYPGSE VKGDFIRRLP
YPDSEIVVNS GSLNEAIARQ GPNTLDTRVW WDKK