Gene Slin_4369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4369 
Symbol 
ID8728129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5298234 
End bp5299484 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content56% 
IMG OID 
Productglycosyl transferase group 1 
Protein accessionYP_003389149 
Protein GI284039219 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00420943 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.404826 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACATCA GCATTCTGGG CTCTATTGCA ACAAACGACC TGCTAACGGA AGCCGCCCGG 
CAACAACTCG GTTCGTACCC GAAAGGGCAG CCGGGTGCTC CGTTGATCAG TAATCTCATT
AACGAATACC TGTCGCGGGG GTATAAGGTG CTGGCCATTA CCATGGACGA TCAGCTCTCA
GACGATGAAC CGCCGTTTGT GTATACAGAC CAGTTGCTGA CGTATGTTAT TGTTCCGAAA
CGTAAACATA CCTTCCGCCC CAACGGCAGG CGCCCCGGCC GCACCGCCGA CTTCTTTCGC
TTTGAGCGGA ACCAGATGGT GGCTGTGCTG AAGCAGTACA AACCCGATGT GGTACATGCC
CACTGGACGT ACGAGTACGC GCTGGCCGGA CTGTCGTACA ACCCCAACAC ACTCATCACC
GTACACGACA ACGCCCGGAT AATATTCGGC TATGTCCGTA CGCTGAATCG CTTTTTTCAT
TTGCTGCTCG CCCGGTACGT GTTTCAGCGG GGACGCTGGT TTACGGCGGT ATCGCCCTAC
ATGGCCGGAA CCGTACAGCC ATGGATAGCC GAGCCGGTAG CTGTAGTGCC GAACCCGGTA
CCGATGCCCA AAAAAAACCG GGACTCAACC CGGTCGAACG TGCCTGTAAT CAGCATGGTG
GTGAACGGCT GGGACGACCG AAAAAACAGC AGGAATGCCC TACTGGCGTT TAAAGGTATT
CAGCAGCGGC ACCCAAATGC GGTCCTGTGG GCTTTTGGTA CGGCTTTCGA ACCCGGCGAA
CATGCCGACG CTTTCTGCCG GGAACATCAG ATTCCCAATG TGGTACTGCA TGGTTCAACG
CGCTACGCAG ACGTACTCGA TAAAGTTTCC CAGAGTACCG TTCTGCTTCA TGCCTCCCTC
GAAGAGTCGT TTGGTATGGT ACTGGCCGAA GCCATGAGCT TTGGCGTGCC CGTTGTGGCC
GGGAAAGACA GCGGGGCGGT GGCTTGGGTG GTGGAAGATG GCGGCCTGCT GGTTGACGTC
ACGAAGGTAA ATGAGATGGT GGAGGCCGTC GACAAACTTC TGTCCGACCC GGTACTCTAC
AAACGGTGCT CGGCCAATGC GGTCCGGGTG GTGCAAACGC GCTTCCCGAT TGAGGAGGTC
GCCGATCAGT ACGTGTCGCT TTACAAAAAG CACGGCATCA AACCGGAAAC AGCCATTACA
AAGGGTAAGC CCCGTCAGCA ACTGAGCGTT GATCCGGCTT ACAAAGGGTA A
 
Protein sequence
MYISILGSIA TNDLLTEAAR QQLGSYPKGQ PGAPLISNLI NEYLSRGYKV LAITMDDQLS 
DDEPPFVYTD QLLTYVIVPK RKHTFRPNGR RPGRTADFFR FERNQMVAVL KQYKPDVVHA
HWTYEYALAG LSYNPNTLIT VHDNARIIFG YVRTLNRFFH LLLARYVFQR GRWFTAVSPY
MAGTVQPWIA EPVAVVPNPV PMPKKNRDST RSNVPVISMV VNGWDDRKNS RNALLAFKGI
QQRHPNAVLW AFGTAFEPGE HADAFCREHQ IPNVVLHGST RYADVLDKVS QSTVLLHASL
EESFGMVLAE AMSFGVPVVA GKDSGAVAWV VEDGGLLVDV TKVNEMVEAV DKLLSDPVLY
KRCSANAVRV VQTRFPIEEV ADQYVSLYKK HGIKPETAIT KGKPRQQLSV DPAYKG