Gene Slin_1237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1237 
Symbol 
ID8724970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1508876 
End bp1510417 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID 
Productalpha-L-arabinofuranosidase domain protein 
Protein accessionYP_003386086 
Protein GI284036156 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.120274 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC AAATCCTAAC CCTATCCTTT GCCTGTCTGG CTCTTTTTGC GTCGGCCCAG 
AACAAAGTGA CGATCAATGC CGGAGACGGC AAAACGGTCA TCAGTAAACA TATTTACGGC
CACTTCGCCG AACATTTGGG CCGCAGCATC TACGATGGCT TTTATGTAGG CGACAATAAC
ACCAAAATCC CCAACAAAAA CGGGGTTCGG CTGGACGTTG TCAACGCCCT GAAAAAGATG
AAAATCCCGA ACCTGCGCTG GCCGGGGGGC TGCTTTGCCG ACACCTACCA TTGGAAAGAT
GGTATCGGCC CGAAAGCGAA ACGCCCAAAA ATTGTGAATA CCTGGTGGGG AGGTGTTACG
GAAGACAATA GCTTTGGTAC GCACGACTTC CTCAACATGT GCGAACTGCT GGGCACCGAG
CCGTATCTGG CCGGAAACGT GGGTAGTGGC ACCGTGCAGG AATTGTCGGA CTGGGTTCAG
TACGTCAATT TCGAGAAGAA CAGCCCGATG GCGAACTTAC GTCAGCAAAA TGGTCGTCAG
GCTCCCTGGA ACGTGAAGTA CTGGGGCGTA GCCAACGAAG CCTGGGGCTG TGGCGGCAAT
ATGAAGCCCG ATTATTACGC CAACCTGTAT CGCCAGTACA GCACCTTTAT GAACAACAAG
GTGGGCGATG GCAAGATTTT CCGGATTGCA TCGGGAGCCA GTGACAATGA CTACACCTGG
ACCGAAACGC TCATGAAAAA CATTCCGTCT ACCATGATGG AAGGGCTGGC CATGCACCAT
TACTCGGTGC TCTCGTGGGG CGAAGGCAAG AAAAGCTCGG CGACTCAGTT TACGGACGAG
GAGTACTTCA AAACGATGCA GCAAGCCCTG TTGATGGACG AACTGATCGA GAAACATTCG
GCCGTGATGG ACAAGTACGA TCCGGAGAAG AAGGTGGCCC TCATTGTCGA TGAGTGGGGC
GGCTGGTACA ATGTGGAGCC GGGTACCAAT CCGGGATTCC TGTATCAGCA AAACACCATG
CGGGATGCCG TACTGGCCGG GTCCACCCTC AACATTTTCC ACAAACACGC CGAGCGCGTG
CGCATGGCCA ATTTGGCCCA GGCCATCAAC GTGCTACAGG CGGTGATCCT GACCAAAGGC
GACAAAATTC TGCTGACGCC AACGTATCAC GTGCTGGAAA TGTACAACGT ACACCAGGAT
GCTACCCTGC TGCCGGTAAG CGTAAAATCC GACGATTTCA CGTTTGGGAA AGACAAACTG
CCGGCCGTGT CGGTGTCGGC CTCCCGCGAC AAAGCCGGGA AAGTCCACAT TTCGCTCGTG
AACATCGACC CAACGAAGCC ACAGGAAATT TCGGTCGATC TAAACGGCAT CAAATCATCC
GGACTAACGG GCCGCATTCT CACTTCGGCT AACGTACGCG ATCACAACAC GTTTGAGAAT
TTAACGAAAA TAAAGCCGGT CGCTTTTAAC GGGGCAAAAC TTAGTAGCGA CAAACTGACC
GTAACCCTGC CTCCGGTATC GGTGGTGGTG CTGGAACTTT AA
 
Protein sequence
MKKQILTLSF ACLALFASAQ NKVTINAGDG KTVISKHIYG HFAEHLGRSI YDGFYVGDNN 
TKIPNKNGVR LDVVNALKKM KIPNLRWPGG CFADTYHWKD GIGPKAKRPK IVNTWWGGVT
EDNSFGTHDF LNMCELLGTE PYLAGNVGSG TVQELSDWVQ YVNFEKNSPM ANLRQQNGRQ
APWNVKYWGV ANEAWGCGGN MKPDYYANLY RQYSTFMNNK VGDGKIFRIA SGASDNDYTW
TETLMKNIPS TMMEGLAMHH YSVLSWGEGK KSSATQFTDE EYFKTMQQAL LMDELIEKHS
AVMDKYDPEK KVALIVDEWG GWYNVEPGTN PGFLYQQNTM RDAVLAGSTL NIFHKHAERV
RMANLAQAIN VLQAVILTKG DKILLTPTYH VLEMYNVHQD ATLLPVSVKS DDFTFGKDKL
PAVSVSASRD KAGKVHISLV NIDPTKPQEI SVDLNGIKSS GLTGRILTSA NVRDHNTFEN
LTKIKPVAFN GAKLSSDKLT VTLPPVSVVV LEL