Gene Slin_3853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3853 
Symbol 
ID8727611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4624016 
End bp4625206 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID 
Productglycosyl transferase family 2 
Protein accessionYP_003388642 
Protein GI284038712 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTATTG CCCTTAGCTT ATTAACTCTA CTTTTCTTTT TGACGGCTGT TGAAACGATC 
TACCTGCTCG TTTTCACGAT GGCCGGCCGG TTCGGACACC TGCAAGCCCC TCCGGTAAAT
CCAAATCCTG TCGCCAAGCG GATGGCCGTG CTGATTCCGG CCTATAAGGA AGATGCCGTT
ATTACCGACA CAGCCGAACA GGCCCTCAAA CAGGATTATC CGCGTGATGC CTACGATGTT
ATTGTCATTG CCGATTCACT CCAGCAAGAT ACGCTCGACA AACTAGCCGC CCTGCCTATT
ACGGTTGTTG AGGTAAGTTT TGACGTTTCG ACGAAAGCCA AGGCGCTGAA TGCGACCATG
AGTAAGCTCA GTGCCGCGTA CGATGTGGCC GTCATTCTGG ACGCCGACAA TGTGATGGCC
ACCAACTTCC TGACCCATAT CAATGCCGCA TTCAACGGCG GCTGGAAAGC GGTACAAGGC
CACCGGGTAG CTAAAAACAC CAATACCAGC GTGGCTATTC TGGATGCCGT CAGTGAAGAA
ATCAACACCT ACATACTCCG TCGTGGCCAC CGGGCCCTGG GGCTGTCGGG TAGTTTGATG
GGCTCCGGTA TGGCGTTTGA CTATACCTTA TTCAAGCAGT ATATGGGTCA GATTAACACT
ACAGGCGGAT TCGATAAAGA ACTGGAAATG CGGCTCATCC ACGACCACCA CCGGATCGAC
TACATCGATG AGGCCTTGTG TTACGACGAG AAAGTACAAA GCGGGGCCGT ATTTGAACGG
CAACGGGCCC GGTGGATTGC CGCTCAGCTC AAGTACCTGC GCCGTAACCT GCCCTCGGGC
ATCGTTCAGT TACTGAAAGG CAATCTGGAT TACTTCGATA AGGTTTTCCA GACCATGTTT
TTACCCCGGG TAATTCTGCT GGGCTTCCTC ACCATTGGTA CGGGTGCAGC GCTTGTCTTG
CAGGATTCCA ACCTGCTTAT GCTCGCGGGA GGGCAGCTCC TGATTCTCTT ACTTACCTTT
TACATCGCCA CGCCGAATGA GCTGCTGGCC CTGATCACCT GGAAAGAAAT AAGCCAGATT
CCCGGTTTGT TCTTCCGCTT TTTACGGTCG ATAACCCGCC TGGGCGAAGC AAGCAAGAAG
TTTATCAATA CCCCTCATTC AACCACAAGT ACAGCCATAA ACGAAGGATG A
 
Protein sequence
MTIALSLLTL LFFLTAVETI YLLVFTMAGR FGHLQAPPVN PNPVAKRMAV LIPAYKEDAV 
ITDTAEQALK QDYPRDAYDV IVIADSLQQD TLDKLAALPI TVVEVSFDVS TKAKALNATM
SKLSAAYDVA VILDADNVMA TNFLTHINAA FNGGWKAVQG HRVAKNTNTS VAILDAVSEE
INTYILRRGH RALGLSGSLM GSGMAFDYTL FKQYMGQINT TGGFDKELEM RLIHDHHRID
YIDEALCYDE KVQSGAVFER QRARWIAAQL KYLRRNLPSG IVQLLKGNLD YFDKVFQTMF
LPRVILLGFL TIGTGAALVL QDSNLLMLAG GQLLILLLTF YIATPNELLA LITWKEISQI
PGLFFRFLRS ITRLGEASKK FINTPHSTTS TAINEG