Gene Slin_5020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5020 
Symbol 
ID8728785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6117397 
End bp6118653 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content52% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389796 
Protein GI284039866 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTGT TCCGAATTAG TTGGAGTAAC CTGAAAGACA AACCCCTGAG CAGTTTTTTA 
AGCGGACTGC TCATGACGTT CGGTATTACG ATTATTTCGT TGCTGCTGTT GCTCAATAAG
CAATTGGATG ATCAGTTTCG CAAAAACATA AAAGGGATCG ACATGGTGCT TGGTGCTAAA
GGAAGCCCGC TGCAGCTGAT TCTGGCGAGC ATCTACCAGA TTGATTCGCC AACGGGTAAT
ATTCCACTCG ACGAAGCCGA ACGCCTGACC CGCAACCCGA TGATCAAAAC GGCGATTCCC
CTGTCGATGG GCGATAATTA CCAGTCATTT CGGATTGTGG GTACCAACAA AAAATACCTC
GATCATTTTG GTGCTACCGT TGCTCAGGGG AAGCTTTTTG ACAAAGCCCT GGAAACCGTA
ATTGGTCCAC GGGTGGCGGC CGTTACCGGC TTAAAACTTG GCGATACGTT CTCTGGCTCA
CACGGCCTCG ACAAAGACGG CGACGTACAC GCCGATACCA AATATAAGGT GGTTGGTATC
CTGAGCCCCA CCAATACCGT TGCCGACCAG CTGATTGTTA CGCCATTGTC CAGCGTTTGG
GCCATTCATG AGCACCATGA GGAACACGAA GAGGGACACC ATGACGAAGA AACGCAGCCA
GCCGGACCAA CGCTTGGCGG CCCGGCGCCC GATGTAGCAG AAGAACCCGG AGAGCCCCGG
GAGATTACCA GTATGCTTAT CAAGTTTCGG AACCCGTTGG GGATGATGCT GGCGCGGGGC
ATTAACAGCA ACTCCAAACT TCAGGCGGCA TTACCCAATA TTGAGATAAA TCGCCTGTTT
TCATTGCTTG GTGTGGGTGT TGAAACACTG CGGGGCCTGG CTATCGTCAT CATGCTGATT
TCCGGCATCA GCGTGTTTGT TTCGTTGTAT AACTCATTGA AGGAACGACG CTATGAAATG
GCTCTGATGC TGTCGATGGG CGCTACCCGT GCACAGCTTT TCGGTATGTT GCTCCTCGAA
GGACTGGTGC TGGCACTGAT CGGCTTCATC CTCGGCATAC TTCTCAGTCG CGTTGGCTTG
TGGTTATTTT CCAGCAGTGT ATCGTCGGAA TACCATTATA ATCTGGCCGC ATTCGGTATT
CTGCCCGAAG AGTGGGTTTT GCTCGGCGTT GCCATTCTGA TTGGCCTGCT GGCCGCTGCC
CTACCCGCTC TGGGCGTCTA CCGCATGAAC ATCTCCAGAA CGCTGGCTGA AGAATAA
 
Protein sequence
MNLFRISWSN LKDKPLSSFL SGLLMTFGIT IISLLLLLNK QLDDQFRKNI KGIDMVLGAK 
GSPLQLILAS IYQIDSPTGN IPLDEAERLT RNPMIKTAIP LSMGDNYQSF RIVGTNKKYL
DHFGATVAQG KLFDKALETV IGPRVAAVTG LKLGDTFSGS HGLDKDGDVH ADTKYKVVGI
LSPTNTVADQ LIVTPLSSVW AIHEHHEEHE EGHHDEETQP AGPTLGGPAP DVAEEPGEPR
EITSMLIKFR NPLGMMLARG INSNSKLQAA LPNIEINRLF SLLGVGVETL RGLAIVIMLI
SGISVFVSLY NSLKERRYEM ALMLSMGATR AQLFGMLLLE GLVLALIGFI LGILLSRVGL
WLFSSSVSSE YHYNLAAFGI LPEEWVLLGV AILIGLLAAA LPALGVYRMN ISRTLAEE