Gene Slin_4080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4080 
Symbol 
ID8727838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4909640 
End bp4910845 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content51% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003388866 
Protein GI284038936 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.606735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.10978 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTACTG CCCTGCTGCT ATTCTGGCTA TCTATAGTCG GTATTCAGCT TATCTATATC 
TTCTTCGTTT ACACCAAAAC GGCTTTTTAT CGCCATCCAG ACCGAAACTA TGCCGTTTTA
TCCTACAATT CTGCCGACTA TTATTCCGAA TCAGACGATC AGCAGGGCGT AACGGTCATC
GTCTGTGCCC GTAATGAGTT GGCCAACCTT AAGGAGCTGC TGCCCCTGCT GAACAACCAG
GACTATCCTA CGTTCGAGAT ACTCGTCATG GACGACCGCT CGACCGATGG CACCTACGCG
TATCTGGAAA ACGACATCCC TGAACTAAGC CGGGTTCGTG CTATCCGTAT CGATAAGGAG
CACCAGCACG TAACGCCCAA AAAGTACGCC CTCACCATTG CCATCAAAAA AGCCCGCTAC
CCTACTGTTC TGCTTACAGA TGCAGACTGT CGCCCGGCTT CGCTGAACTG GCTTACCGAA
ATGACCGAGC CATTGATTTT CGGTTCCAAA GACATTACGA TTGGGTTTTC GCCTTATGAA
TATTACCCGG GGTTGCTCAA CCTGCTGATC CGCTCAGAAA CCCTGTTTAC GGCTATTCAG
TATTTTTCAC TGGCTCTGTC GGGGCGGCCC TATATGGGCG TTGGCCGGAA TATGGCCTAC
CGGACCGACC TGTTTTTTGC GAATAAAGGC TTTTATACGC ACATGAATGT GGTTGGTGGC
GACGATGATC TGTTCATCAA CGAAGTCGCT ACCCGCTCTA ACACGTTCGT TTGTCTGGCA
CCCGATACAT TTGTATGGTC GAAACCCAAG ACAACCTGGG CCGAATGGCG GCAGCAGAAA
CGACGCCACC TTAACGTGGG CAAGTACTAC AAAACAGGGA ATAAAGTGCG GCTGGGGCTG
CTCACCGGCT CGCATGTGCT AAGTTGGGCT ATGGCCCTGG TAGTGGGTTT GCTGGTAGTC
GTTCATGCGC TTCACTGGTA TTCGTTTTCC AGCGACGAGT GGTTACTTTT GCTGGTCAGT
ACAGGGGCTT TTATTCTCCG GCAACTTGCC TTCTGGGTGA TTGTCGGACG AATCAGCCAC
CGACTGGCCC ACACCGTTCA CTGGTCCTTC ATACCGTTTA TGGACCTGCT GATGGCCGTT
TATTACGGAC TGGCTGGTCT GAAAACGCTG TTTAACCGTC GCAAAAAACA AATTTACTGG
CGATAG
 
Protein sequence
MITALLLFWL SIVGIQLIYI FFVYTKTAFY RHPDRNYAVL SYNSADYYSE SDDQQGVTVI 
VCARNELANL KELLPLLNNQ DYPTFEILVM DDRSTDGTYA YLENDIPELS RVRAIRIDKE
HQHVTPKKYA LTIAIKKARY PTVLLTDADC RPASLNWLTE MTEPLIFGSK DITIGFSPYE
YYPGLLNLLI RSETLFTAIQ YFSLALSGRP YMGVGRNMAY RTDLFFANKG FYTHMNVVGG
DDDLFINEVA TRSNTFVCLA PDTFVWSKPK TTWAEWRQQK RRHLNVGKYY KTGNKVRLGL
LTGSHVLSWA MALVVGLLVV VHALHWYSFS SDEWLLLLVS TGAFILRQLA FWVIVGRISH
RLAHTVHWSF IPFMDLLMAV YYGLAGLKTL FNRRKKQIYW R