Gene Slin_1953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1953 
Symbol 
ID8725690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2363715 
End bp2364932 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content45% 
IMG OID 
Productintegrase family protein 
Protein accessionYP_003386797 
Protein GI284036867 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.247559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.500213 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCTT CGATCAAGAT TGTCCTGCGT AAGAAACCCA ATCAGGATGG CACGTTTCCC 
ATTGCCATTC GTATAACCAA AGACCGCAAG TCCACGTATA CCTATCTAGG ATACAGTGTG
ACCGAAGCCC AATGGGATGC CAAAGAGCAT AAAGTTAAAA AAAACCATCC GAATTCAACC
CGGCTGAATA ACCTCATTGA GAACAAAAAA ACGGAACTCA GTGATAAACT GATCGAACTG
CAAGTCCAGC AGAAAGACAC ATCGGTAAGC GCCATTCGCA AGCAGATCAA ACCGAAACAA
CATACATCCT TTTTTACGCA GGCAGGTGCG TACATTGAGA ATATGCGTAA AGAGGGCAAG
TATAACCGCG TACTGACAGA AGAGGCCCGT ATCAAACATT TTAAATCCTT TCTGGATGAA
GGTGACATTA CGTTCCCAGA GATTGATGTG CCGCTGCTCA ACCGGTTCCG TGCATATTTG
AAAAGCGAGC GGAAAGTCAG TGAACGAACG ATCATCAACC ATCTGATTCT AATCCGCACC
GTTTATAATC AGGCGATTGC CGGTGATATT GTTGACCGAA AATATTATCC GTTTGGCAAA
GGGAAAATTG GTATCAAATT TCCAGACTCG ATTAAGCTGG GACTGGTGCC AAAAGAAGTG
AGAGCACTGG AAACCGTTGA TCTGTCCGAC AACACTTATT ACAACCATGC CCGAAACCTC
TGGCTCATGG CATTCTACTT CGCCGGGATG CGTGTATCTG ATGTGCTGCG GCTTAAATGG
TCGGATTTTC AGGATGACCG GCTCCATTAT ACAATGGGCA AGAACAAAAA GACGGGGTCA
TTAAAAACAC CGGACAAGGT CATTGGCATT CTCAGCCAAT ATCGTCAAGA CCCCAAAAAA
CATAACTTGA TTTTTCCTGA ACTCAAAGTA CTGGATGATC TGGATGATAC GTATCATGTG
CAACGTAAAA TTTCTTATGC TGTCAAGCGC CTGAATACGG CATTGGAGGA GGTGGCCAAA
CGTGCCGAAA TTACCAAGCC TGTGACGATG CACATTGCCC GGCATACTTT CGGCAACATT
TCAGGCGATA AAATCCCTAT TCAGAGACTA CAGGAATTGT ACCGGCATTC CAGCATTACC
ACGACCATTG GCTACCAGAG CAGCTTTATT AACAAGACAG CAGACGATGC ATTAGACGCG
GTTTTGAACA TGGAGTAG
 
Protein sequence
MASSIKIVLR KKPNQDGTFP IAIRITKDRK STYTYLGYSV TEAQWDAKEH KVKKNHPNST 
RLNNLIENKK TELSDKLIEL QVQQKDTSVS AIRKQIKPKQ HTSFFTQAGA YIENMRKEGK
YNRVLTEEAR IKHFKSFLDE GDITFPEIDV PLLNRFRAYL KSERKVSERT IINHLILIRT
VYNQAIAGDI VDRKYYPFGK GKIGIKFPDS IKLGLVPKEV RALETVDLSD NTYYNHARNL
WLMAFYFAGM RVSDVLRLKW SDFQDDRLHY TMGKNKKTGS LKTPDKVIGI LSQYRQDPKK
HNLIFPELKV LDDLDDTYHV QRKISYAVKR LNTALEEVAK RAEITKPVTM HIARHTFGNI
SGDKIPIQRL QELYRHSSIT TTIGYQSSFI NKTADDALDA VLNME