Gene Slin_4623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4623 
Symbol 
ID8728387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5622665 
End bp5625091 
Gene Length2427 bp 
Protein Length808 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389400 
Protein GI284039470 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.417059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTAACCA ACTATCTCAA AATCGCCTTT CGCAACCTCT GGAGAAACCG GGGGTATGCG 
GCTATCAATG TCGTTGGCCT GGGTGTGGCT TTCTGCATCT GCGCCTTTCT GTTACTGACA
GCCTACCTGC ACCTCACCTA TGATTCGTTT CATCAGGATG GCGACCGGAT TTTTCAGGCG
TACCTCTTCA GCAACGACCC CGAAAAACCA ACAAAAAGCG GGACATTACC ACTGCCCCTT
GCCCCCGCCC TTCATACCGA TTTTCCCGAA CTGGAAGCCA CCCGCATTAT GGCCGGTCGC
AAAAGTCTGG TCGAGGCCAA CGGTAAGTAC TTCGACAAGC TGATCAATTT CACGGACCCA
TCGTTCCTGA ACATCTTTTC GTTTCCGCTC CTTGTCGGTA ATCGGAAAAC GGCCCTACGC
GAGTTGAGCA GTATTGTGAT TAGCGAGAAC ATGGCAAAGG ACCTGTTTGG AACTTCCAAT
CCCGTAGGTA AGCGCCTGCG AATGGGTAAC GATGGCCGTC AGAAAGATTA TATCGTAACA
GGTGTTGTGG CCGATGTGCC CGACAACTCG TCGGTTCGAT TTGATGCCCT GGCGCGTATT
GAGAACGCCC CAAATTATCA GGACAGCAAA GATAAGTGGG ACGCTAACTC GCATACGGTA
TTCGTAAAAT TACCGGCTCA GGTCGATCAG GCTACGTTTG AAGATCGGCT GAAACCCTTT
GCGCAAAAGT ATTATCAAGG CGCTATCGAG GAGTTAAAGA AGAAAAAGGC CCAGCCCGAT
TCACGCGGTG ATTTATTTGC TGTGCGCTTA CAGAAGCTGG CGAATGTCCA TTTCAACCGG
GACTTAAGCG ACAGCAAAGG CACACCCATT GCGGTAATTT ACGTGCTGAT GGGCATGGCT
TTTTTTATTC TGGCCATTGC CTGCATCAAC TTCATCAACC TGAGCATTGC GCGGTCCTTC
ACCCGAGCGC GGGAGATCGG CGTTCGTAAA TCGCTGGGTG CGCTCAAAAG CAGTTTGTTC
GTCCAGATTT GGAGCGAATC GGGGTTGATC TGCATCGTAG GCTTTCTGGC TGGGGCGGTA
CTGGCTTACC TGCTCATGCC TGCCTTCAAC GCTCAGTTCG GGGCCAAACT CAAGCTGGCG
TATGCGCTTC AACCCGGCTT TATCGCCCTC TACGGATTTG TCATTCTGCT GGTTACGCTG
GTAGCGGGTG GGTATCCGGC CTGGCAAATG GCGAAGTTCA ATACGGTGGA TGTGCTCAAG
GGGAAAGTGA CGACCAAACG GCCGGGCGTG TTGCGAAATG CGCTAATTGT CACCCAGTTT
ACGCTCTCCT GTCTGCTGGT TTGCTGCACC GTCATTGCCT TTCAGCAGGT AGGTCATTTG
CGCCAAAGTC CGCTCGGTTT CGACAAAGAG CAGGTGATCA GTATACCGGT CGGCACGCAG
GTTGACGGTC GGCAGGTGTT GCAGAGATTA CGGAATAAAC TGGCCAATGA TCCAACAGTA
TTGGCCCTGA CAGGCACCAG CACGAACCTC GGCAGAGGCA AAGACCGCGT GAGTTCGAGA
TCGACCGTCG GGTATACCAA CAAGGGAAAG AGGATCTCGA CCGACATTTT GATGATCGAC
TATGACTACC TCAAAACGCT TAAAATAAAG CTGTTGGCCG GGCGTGATTT TAACCGGGCC
TATGCCAGCG ATTCGGTTAA TCGGGTCATC GTTACACAGA GCATGGCCAA GATGATGGGC
GTGACCAACC CGGTCGGTAT GTTGCTCGGC GATGACGAAG ACACGACCGG CACCAAATCC
CAGATCATCG GCGTCGTGCC CGATTTTCGG CTTTATTCCG TGGCCGACAA CGCCAATCCG
ATCACCATGT ACCTATCGGC TACCGAACCA ATTCACTACG TTTTTGTGCG GGTAGCCCCG
CAAAGTTTGG GTGGGGCTAT GGCGAAGCTT CAGGAAGTGT GGGCCGAGGT GGCTCCACAA
TCGGAGTTTA TGGGTTCGTT TCTGGATGAA AACGTCGACG CCTGGTATCA GAACGAAGAA
CAGCTTTCGC AGATATTAAG TCTGGCGTCA AGCGTGGCTA TTTTGCTGTC GTGCATCGGT
TTGTTTGCTA TTGCCCTGCT CATGGTCGAA CAGCGAACGA AGGAGATCGG TATCCGAAAA
GTAATGGGAG CCAGCATTCC CGGCATTGTC CTGATGCTCT CGCGTGGCTT TGTCAAACTG
GTGTTGATTG CGCTGTGCAT CGCCGTGCCG TTGGCGTGGT TTGGCATGCA AACCTGGCTG
AATAACTACT CCTATCGCAT CGATATCAGC CCGTGGGTAT TCATAGGCGT TGGCCTGTCG
GCCATCTTTA TTGCGCTGGC AACGGTGAGT TTTCAGAGTA TCAAAGCCGC GCTGATGAAC
CCGGTGAAAT CATTACGATC GGAGTAG
 
Protein sequence
MLTNYLKIAF RNLWRNRGYA AINVVGLGVA FCICAFLLLT AYLHLTYDSF HQDGDRIFQA 
YLFSNDPEKP TKSGTLPLPL APALHTDFPE LEATRIMAGR KSLVEANGKY FDKLINFTDP
SFLNIFSFPL LVGNRKTALR ELSSIVISEN MAKDLFGTSN PVGKRLRMGN DGRQKDYIVT
GVVADVPDNS SVRFDALARI ENAPNYQDSK DKWDANSHTV FVKLPAQVDQ ATFEDRLKPF
AQKYYQGAIE ELKKKKAQPD SRGDLFAVRL QKLANVHFNR DLSDSKGTPI AVIYVLMGMA
FFILAIACIN FINLSIARSF TRAREIGVRK SLGALKSSLF VQIWSESGLI CIVGFLAGAV
LAYLLMPAFN AQFGAKLKLA YALQPGFIAL YGFVILLVTL VAGGYPAWQM AKFNTVDVLK
GKVTTKRPGV LRNALIVTQF TLSCLLVCCT VIAFQQVGHL RQSPLGFDKE QVISIPVGTQ
VDGRQVLQRL RNKLANDPTV LALTGTSTNL GRGKDRVSSR STVGYTNKGK RISTDILMID
YDYLKTLKIK LLAGRDFNRA YASDSVNRVI VTQSMAKMMG VTNPVGMLLG DDEDTTGTKS
QIIGVVPDFR LYSVADNANP ITMYLSATEP IHYVFVRVAP QSLGGAMAKL QEVWAEVAPQ
SEFMGSFLDE NVDAWYQNEE QLSQILSLAS SVAILLSCIG LFAIALLMVE QRTKEIGIRK
VMGASIPGIV LMLSRGFVKL VLIALCIAVP LAWFGMQTWL NNYSYRIDIS PWVFIGVGLS
AIFIALATVS FQSIKAALMN PVKSLRSE