Gene Slin_3145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3145 
Symbol 
ID8726898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3815144 
End bp3816352 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content48% 
IMG OID 
ProductProtein of unknown function DUF1972 
Protein accessionYP_003387955 
Protein GI284038025 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.281448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAGA CCAACGTAGC AATTATTGGA ACTGTAGGTA TCCCGGCCAA ATATGGCGGT 
TTCGAAACCC TAGCCGAGCA TCTGGTGGAT CACCTGGGTA ACGAAACAAA CATAACGGTC
TACTGTACGA CCAAGAAGTA TGGTCAGTCT GAGCGGCCGG CGACGTATAA AGGGGCACGG
TTGGTATACC TGCCTTTCGA CGCTAACGGT ATTCAGAGTA TCATTTATGA CTGCTTCAGT
ATTTTACATG CATTATTTTA TGCCGACGTA ATGCTCATTC TGGGTGTCAG CGGGGGCTTT
ATGGTGCCCT TTGTCCGGTG GTTTACCAAG AAAAAAATTA TCATCTCGAT TGACGGTATT
GAGTGGAAAC GGAACAAATG GAGCAAACTT GCCCGCTGGT ATCTGTGGGC GGCCGAATGG
GTAGCTGTAC GTTATTCACA CGCGGATATT TCGGACAATG AGTCGATTCA GAATTACACC
GCCATTCGCT ACAAGACGCT GAGCCATATT ATTGAGTATG GCGCCGACCA CACCATCGCG
GTTAGCCCAA CGCCTGCCGA CCGCGAAACG TATCCATTTT TGGCTGCTCC CTACGCGTTT
ACCGTTTGCC GAATCGAGCC GGAAAACAAC ATTCATCTGA TATTGGAGGC TTTTGCCCAA
TTACCGAAGC ATACGCTGGT TATGGTAGGA AACTGGACAA ACAGTGAGTA CGGAGCCAGT
CTGCGCGAAC AACATAAAAA AGATACGAAT ATTCATTTAC TCGACCCTAT TTATGACCAG
CGTCAGCTTG ATTTGCTGCG CAGCAATTGC CTGATTTATG TTCATGGGCA CAGTGCCGGT
GGAACAAACC CATCCCTGGT AGAAGCCATG TACCTCGGTT TACCGGTTAT TGCGTTCGAT
GTAGCCTACA ACCGGTCGAC TACGGAAAAC AAGGCTCTGT TTTTTAGAAC ATCGGCCGAA
TTGACCAAGC ATATTCAGAA CACCTCGATC AGCGAACTGA AGAACCGGGC CGATATCATG
AAGACCATTG CTTATCGCCG GTATACCTGG ACTGTGATTG CCCAGAAATA CGCTTACCTG
ATTCGCCTTG TACAGCAGGT ATCTACCAAA AAAGAACTGA ACTCCCTGGC CGGTACACGC
CTGTCTTCCG ATTGGCTGCT GGAGTCGGAA TTGGCTCACT TAGAGAGACC ATCTTACTTT
TACGAATAA
 
Protein sequence
MSKTNVAIIG TVGIPAKYGG FETLAEHLVD HLGNETNITV YCTTKKYGQS ERPATYKGAR 
LVYLPFDANG IQSIIYDCFS ILHALFYADV MLILGVSGGF MVPFVRWFTK KKIIISIDGI
EWKRNKWSKL ARWYLWAAEW VAVRYSHADI SDNESIQNYT AIRYKTLSHI IEYGADHTIA
VSPTPADRET YPFLAAPYAF TVCRIEPENN IHLILEAFAQ LPKHTLVMVG NWTNSEYGAS
LREQHKKDTN IHLLDPIYDQ RQLDLLRSNC LIYVHGHSAG GTNPSLVEAM YLGLPVIAFD
VAYNRSTTEN KALFFRTSAE LTKHIQNTSI SELKNRADIM KTIAYRRYTW TVIAQKYAYL
IRLVQQVSTK KELNSLAGTR LSSDWLLESE LAHLERPSYF YE