Gene Slin_2884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2884 
Symbol 
ID8726634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3485068 
End bp3486765 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003387696 
Protein GI284037766 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAC TGATTATCTA TCAGATTTTT ACCCGTCTGT TTGGCAATCA AAATACCACC 
AACATATACA ATGGCACGTA TGCAGAAAAC GGTGCCGGTA AATTCAACGA TATCAACGAA
GCAGCCTTGC AGTCGATCCG TCAGTTTGGC GTTTCGCACG TCTGGTTTAC CGGAATAGTG
GAGCACGCTA CGCAAACCGA CTACTCAACG TTCGGTATTA AACCCGACGA CCCGGCAGTT
GTAAAAGGTC GGGCAGGCTC CCCCTATGCT GTTAAAGATT ATTACGACGT CGACCCTGAC
CTGGCGGTTG ATGTGCCCAA TCGAATGGCC GAATTTGAGG CCCTTGTTCA ACGTACCCAC
GCTCACGGCC TGAAAGTTAT CATCGACTTT GTCCCGAACC ACGTAGCCCG GCAATATGGG
TCGGATGCCC GGTCCGGTGA CGTTGTCGAT CTGGGTGAGA CCGACGATAC TTCGGTCCAT
TTTTCACCGA ACAACAATTT TTATTACCTG CCGGGTGAGC CCTTTGTTGC TCCGGAAGCC
GGGAACATAC CTCACGCTGG TTCACCTATT CACGAATACC CGGCCAAAGT GACCGGCAGC
GGCTCTATAA CGGCCCGGCC CGACATCAAC GACTGGTACG AAACCGTTAA GCTTAACTAC
GGTATCAATA TATTCGACGG GAGCCGTCAT TTCGATCCCA TTCCGGATAC CTGGCACAAG
ATGCTCGATA TTCTGTTTTT CTGGGCGGCT AAAGGAATTG ACGGGTTCCG ATGTGATATG
GCGCACATGG TTCCGGTCGA GTTTTGGCAC TGGGCCATTG GCCGGGTCAA ACAACGGTAT
CCCCGCCTGA TTTTCATCGC TGAAATTTAT GATCCCGGCC TTTACCGCTC TTTCATTTTT
GAAGGTGGGT TCGATTATCT CTACGATAAG GTAGGCCTAT ATGATGCGCT CCGGCGGCTT
ATGGAGGGCC ACGGCTCCTG CTACGAGCTA ACCCGTGTGT GGCAGCAGGA ATCGGGCGAT
TTCGCCCAGC ACATGCTCCG GTTTCTGGAA ACCCATGACG AACAGCGCAT TGCGTCGCGT
TTTTTTGCCA ACGACCCCTG GAGCGCTGTC CCTGCCATGA CGCTGACGGC CACCATGCAC
ACCGGCCCAA CGCTGCTTTA TTTCGGACAG GAAGTAGGGG TCCGGGCCGA AGGGTCGGAA
GGCTTCAGTG GCGATGATGG AAGGACGACC ATATTTGATT ACTGGGGGCT GACCGACTGG
CAGGGCTGGC TCAACAAAGG TCGCTATGAT GGAGCGGGAC TAACCGACGA CCAGCGGAAC
CTACGCTCTT TCTACCAACG GTTAAACCAC CTCGTAAACG GGTCCGATGC CATTCAGAAC
GGCTATTTCT ATGACCTACA ATACGTAAAC GATAATGGAC AGAGCACGGG TTACGATGCC
CATCGCGTAT ACAGTTACCT GCGGTACACA GATCGCCAGA AACTGTTGAT CGTCTGTAAT
TTTTCTCAGC ACCTAACGTA CGAAACCAAC ATCAAAATTC CAGAGGCAGC GTTCGATGCC
ATGGGTATAA ACCCTTCCAG AACGCTGCGG CTGAGCGATA TTTTCCTGAC AGATATGCAG
CTTGAAGCAG TTGGACGTAG CGGCATCCCC CTGGAGCTTC CTCCTCGTTG TGTGCGGGTG
CTGGAAATAA AATTGTAG
 
Protein sequence
MDKLIIYQIF TRLFGNQNTT NIYNGTYAEN GAGKFNDINE AALQSIRQFG VSHVWFTGIV 
EHATQTDYST FGIKPDDPAV VKGRAGSPYA VKDYYDVDPD LAVDVPNRMA EFEALVQRTH
AHGLKVIIDF VPNHVARQYG SDARSGDVVD LGETDDTSVH FSPNNNFYYL PGEPFVAPEA
GNIPHAGSPI HEYPAKVTGS GSITARPDIN DWYETVKLNY GINIFDGSRH FDPIPDTWHK
MLDILFFWAA KGIDGFRCDM AHMVPVEFWH WAIGRVKQRY PRLIFIAEIY DPGLYRSFIF
EGGFDYLYDK VGLYDALRRL MEGHGSCYEL TRVWQQESGD FAQHMLRFLE THDEQRIASR
FFANDPWSAV PAMTLTATMH TGPTLLYFGQ EVGVRAEGSE GFSGDDGRTT IFDYWGLTDW
QGWLNKGRYD GAGLTDDQRN LRSFYQRLNH LVNGSDAIQN GYFYDLQYVN DNGQSTGYDA
HRVYSYLRYT DRQKLLIVCN FSQHLTYETN IKIPEAAFDA MGINPSRTLR LSDIFLTDMQ
LEAVGRSGIP LELPPRCVRV LEIKL