Gene Slin_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4226 
Symbol 
ID8727985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5094081 
End bp5095334 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389010 
Protein GI284039080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGG AGCCAATTGG AATTTTATTA CTTTTGCTGC TGGCGACTCC TGTGCTGTTG 
GCGCAGACGC CCACTTATAC GGCCGATATT CAGCCCATTC TGGCTCATCA TTGTGCTCCC
TGCCATCACC CGGGCGGTTT AGGGCCGTTT AGCCTGCTGA CCTACGAAGA CGTAGCCAAA
CGAGGTAAAT TTATTGCCAA AGTCACCCAG ATTCGATACA TGCCGCCTTT TCCAGCCGAC
CGGCAGTTTC AGCATTATGC GAACGAGCGG GGGCTGTCAG AAGCCGAAAT CAATACCATT
CAGGCCTGGG TGCAGGGGGG GATGGTACAA GGTAAAGAGG TACGGGGAAA GGATAATAGG
GTGGGGGCGA AAGGAGGTGC GATCCAATCC GAACGCAATG CCCGAACGCC GGACCTTGTG
CTGCGTATGA AGCCTTACAA TATTAAGGGC GACGTGCAGG AGGACTTTCG GTATTTCCAC
GTACCCATGG GCTTAACGCA GGACATATGG GTCGAAGCCG TTGAGTTTGT ACCTGGTAAC
CGTAAGTTAC TCCACCACAG TCGGCTTATG ATCGACTCTA CGGGCACGAT GGCCGGTATT
GACGGCATAA GTGAAGAGGA CCCCCGACTG CGGGAATTTC AGAAAACACC GCTGGCCGAT
GAGTTTCTGT ATGGATGGGT GCCGGGTAAT GACCGGGTAA CATTCCCGGA GGGAGCGGCC
AAGCGAATTC GGGCGGGTAG CGACCTTATT CTTAATATAC ACTATGCTCC GTCGGCAAAG
GCCGATCAGG ACCAGTCTGA AGTGAGGTTG TATTTTGCCC GAAAACCAGT GGAACGGGTC
GTGAAAACAC TTACCCTTAC GGAGAATAAT GTGACCAATC AACCCTTTCA ACTGCCTGCC
AATACAAAGC CGACGTTTTT TATGAACTAC GGCCCGCTAC GCGATACAGT CCGCCTTCTA
TCCGTTTTAC CCCACATGCA TCGATTGGGG AAATCGGTTC GGGCGTTTGC CATTACCCCC
GATGGGGATG TGATCAATCT CATAAAGATT GATGCCTGGG ATTATAACTG GCAACTGTCT
TACTTCTTTC AAACCCCGCT TGTGTTGCCT AAAGGGGCTA CTATCATTGC CGAAGCCAGT
TACGACAACA CAGACCAAAA CCCCCTCAAT CCAAACCGGC CTGCCCGAAC GGTGGGCTAC
GGCTGGAACT CGACCGATGA AATGATGAAT CTGGTCTTCT ATTACATAAA GTAG
 
Protein sequence
MKKEPIGILL LLLLATPVLL AQTPTYTADI QPILAHHCAP CHHPGGLGPF SLLTYEDVAK 
RGKFIAKVTQ IRYMPPFPAD RQFQHYANER GLSEAEINTI QAWVQGGMVQ GKEVRGKDNR
VGAKGGAIQS ERNARTPDLV LRMKPYNIKG DVQEDFRYFH VPMGLTQDIW VEAVEFVPGN
RKLLHHSRLM IDSTGTMAGI DGISEEDPRL REFQKTPLAD EFLYGWVPGN DRVTFPEGAA
KRIRAGSDLI LNIHYAPSAK ADQDQSEVRL YFARKPVERV VKTLTLTENN VTNQPFQLPA
NTKPTFFMNY GPLRDTVRLL SVLPHMHRLG KSVRAFAITP DGDVINLIKI DAWDYNWQLS
YFFQTPLVLP KGATIIAEAS YDNTDQNPLN PNRPARTVGY GWNSTDEMMN LVFYYIK