Gene Slin_1238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1238 
Symbol 
ID8724971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1510503 
End bp1511993 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content55% 
IMG OID 
ProductL-arabinose isomerase 
Protein accessionYP_003386087 
Protein GI284036157 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.112661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATC TGAAACAATT TGAAGTCTGG TTCGTCACGG GAAGCCAGCA TTTATATGGC 
GAGGAAACCC TTCGTCAGGT AGCTGACCAT TCGCAGATTA TTGCGCGGTC GTTCGACCAG
TCGCCAACCA TTCCGGTGCG TGTGGTTTAC AAGCCAGTAG TCAAAACGCC GGATGAAATC
TATGCCATTT GTCAGGAGGC CAACGTAGCC CCTAACTGTG TCGGCATTAT CACCTGGATG
CACACGTTTT CCCCAGCTAA AATGTGGATT CGTGGCTTGT CGGTGCTGAA AAAGCCCCTG
CTTCACCTGC ACACTCAGTT TAACCGCGAC ATTCCGTGGA GTTCCATCGA CATGGATTTC
ATGAACCTCA ACCAGTCGGC CCACGGCGAC CGGGAGTTTG GCTTCATCAT GTCGAGGATG
CGGCTGAACC GCAAAATCGT GGTTGGTTAT TGGGAGCAGG ACGATGTGCT GGCCAAAATT
GCCGACTGGA GCCGGGTTTC GGTAGCGGCT TACGAGCTAA AAACCATGAA AGTTGTGCGT
TTCGGTGATA ACATGCGGCA GGTAGCCGTA ACCGACGGCG ATAAAGTGGC AGCTGAGATC
ACCTTCGGCA TGTCGGTGAA TACGCACGGT ATTGGCGATC TGGTGGCCGT TATCAACCAG
ATTTCCGACG CGGAGATTGA CCAGTTGGTA ACTGAATATG CAGACAGCTA CACGCTCATG
GACTCGCTGC TGCGCGGTGG CGCTCAGCAC AGCTCGCTTC GGGATGCGGC CAAAATAGAA
CTGGGTTTGC GGGCCTTTCT GAAAGACGGC AACTTCACGG CCTACACGGA TACGTTTGAA
GACCTTCACG GTATGACGCA GCTACCCGGC ATTGCCTCCC AGCGACTGAT GGCCGATGGC
TACGGGTTTG GGGGCGAGGG CGACTGGAAA ACGGCAGCCA TGGTACGGAC CATGAAAGTG
ATGGCGTCGG GACTGCCGGG TGGCAACTCG TTCATGGAAG ATTATACCTA CCACTTCGAC
CCATCGAATC CGCTGGTATT GGGCTCCCAC ATGCTCGAAA TCTGCCCGTC CATTGCCGCT
GACAAACCAA CCTGCGAGAT TCACCCGCTC GGTATTGGCG GTAAAGCCGA TCCCGTCCGA
CTGGTGTTCA ATGCGCCCGC TGGTCCGGCC ATCAACGTGT CGCTGATCGA CATGGGCAAC
CGTTTCCGAA TTCTGGTCAA TGAAGTCGAA GCCATCGACG TACCGGAAGC CTTGCCCAAG
CTACCCGTTG CGCGGGCCAT CTGGAAACCT ATGCCCGACA TGCAAACGGG TTGTGCCGCC
TGGATTCTGG CGGGAGGAGC GCACCATACC GTGTACAGCC AAAACCTGAC CACGGATCAC
ATCGAGGATT TCGCCGATAT TTTCGGTGTA GAGCTGGTGG TCATCGACCG AAATACGAAC
CTGCGGCAGT TGAAAAACGA GCTGCGCTGG AGTGAGGTGT ATTATAAATA G
 
Protein sequence
MLDLKQFEVW FVTGSQHLYG EETLRQVADH SQIIARSFDQ SPTIPVRVVY KPVVKTPDEI 
YAICQEANVA PNCVGIITWM HTFSPAKMWI RGLSVLKKPL LHLHTQFNRD IPWSSIDMDF
MNLNQSAHGD REFGFIMSRM RLNRKIVVGY WEQDDVLAKI ADWSRVSVAA YELKTMKVVR
FGDNMRQVAV TDGDKVAAEI TFGMSVNTHG IGDLVAVINQ ISDAEIDQLV TEYADSYTLM
DSLLRGGAQH SSLRDAAKIE LGLRAFLKDG NFTAYTDTFE DLHGMTQLPG IASQRLMADG
YGFGGEGDWK TAAMVRTMKV MASGLPGGNS FMEDYTYHFD PSNPLVLGSH MLEICPSIAA
DKPTCEIHPL GIGGKADPVR LVFNAPAGPA INVSLIDMGN RFRILVNEVE AIDVPEALPK
LPVARAIWKP MPDMQTGCAA WILAGGAHHT VYSQNLTTDH IEDFADIFGV ELVVIDRNTN
LRQLKNELRW SEVYYK