Gene Slin_3029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3029 
Symbol 
ID8726781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3664646 
End bp3667912 
Gene Length3267 bp 
Protein Length1088 aa 
Translation table11 
GC content49% 
IMG OID 
ProductASPIC/UnbV domain protein 
Protein accessionYP_003387839 
Protein GI284037909 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0208708 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTGA TTCGTTTTTT AGCCTTTTTG CCACTTTTAG CGGCTTGTAA AGGAGGTTCG 
TCTGATTCAG TTTTTACCCT GCTGAGCGCC AGCCAGACAC ATATTGATTT CGTAAACTCG
ATCCAGGAAA CTGAAGAAGA TAACGTTCTC AATTACGAGT ACTTCTACAA TGGTGGGGGG
GTAGCTGCTG CCGATTTCAA TAATGACGGA CTGATTGACC TGTACTTTAC CGCCAACCAG
GGGGAGGATA AACTGTATCT GAACGAGGGA AAATTATCCT TTAAAGACAT AACCAAAGAA
GCGGGCATCG ACTGGAAAGG CGAGTGGAAA ACCGGGGTCA CGGTTGTTGA TATCAATAAA
GACGGCTGGC AGGATATGTA TGTGTCGGTT TCGGCCAATA TCGATAAGCC AGCTCTGCGG
AAACACAAAC TCTATATCAA TAACAAGACG CTCAAAAACG GCGTACCCTC ATTCACCGAA
CAGGCAGCCG CCTATGGGCT CGACCTGACG ACTTACGCCA CTCAATCCGC TTTCTTCGAC
TACGATAACG ATGGCGACCT CGATGTCTAC CTGCTCAACC ATAATGTAAA AGATTTTAAA
CGCTTTGATG CAGATGCCGT TCATGCCATG CGCGACTCAC TGGCTGGACA TCGGCTCATG
CGCAATGATG GCGGAAAGTT TGTTGATGTA AGTGTACAGG CTGGTATTAA GGGCAATCCG
ATCGGTTTCG GATTGGGTGT ACACACGGCC GATCTCAATG GCGATGGCTG GCTCGATATC
TACGTTTCCA ACGACTATGT GGAAGAGGAT TATCTGTATC TGAATAACAA GAACGGTACG
TTTACTGATG TTGTAAAGGA AGCCACAGGA CATGTGAGTT ACTTTTCGAT GGGCAATGAT
GTAGGCGACA TCAACAACGA TCTGCTCCCC GATATTGTCA CAATGGATAT GCTGCCGGAG
GACAACAAGC GACAGAAACT GCTCTTTGGA CCCGATAAAT ATGAGGCTTA TCTGTCCATG
CTCCGGAACG GGTTTCACCC CGAAGTGATG CGGAACATGC TGCAGCTCAA CAACGGTGTC
GATCGGCAGG GTAGACCTCA ATTTAGTGAA ATAGGGCAGT TGGCGGGTAT TGCCAGTACA
GACTGGAGCT GGTCGGCCTT ACTGGCCGAC TATGATAATG ACGGTTATAA AGACCTGTTT
ATCACCAACG GCTATCTGCG CGATTATACC AACAACGATT TTGTTAAATA TTACGCCGAT
CAGGGAGCGC GCAAAAATCA GAGCGTGATG GAGGTGATTA GCCATATGCC ATCGACCAAA
ACGCCTAATT ATATATTCAG AAATGAGCAT AACCTGACCT TCTCTAACAA ACAGACTGAT
TGGGGCTTCG ATACGCCGGT TATTTCGAAT GGTGCCGTTT ATGCCGATCT TGATAATGAC
GGCGATCTGG AAATAGTGAC CAACAACATC AACGAAAAAG CCCACCTCTA TCAAAACCAA
ACGGCCGAAA AAACAGGTAA TAACTATGTT GACATTGTCC TGAACCCCAA ACAGGCCGCT
CAGTCGGCGA CGGGGACGAA GGTGTATGTA TATAGTGGAG ACTTGCGTCA ATTTCAGCAA
TACACACCAA CGCACGGGTT TCAAAGCAGT ATGATGATTC CCATGCACGT TGGTTTGGGG
AAGGCAAAAA CCATCGATAG CCTGGTGGTT GTCTGGCAGA ATGGTTCGGT GCAGAAGCTA
CGGAACGTAG CCGTAAATCA GCGGCTCACC ATCAATTATG AACCCGGTAC AGAAGCGAGT
ACACCCATTT TGTCTCAGCC TGCTCTGCTC TTCGCGCAAA CGAACACGCT CGACTTTCAA
CATCAGCAAG CACCCCTCAA TGATTTCAGC CGACAACTTT TGCTGCCGCA TATGTATTCA
TACGCGGGTC CACGCATGGT GAAGGGCGAC GTCAATAAAG ATGGACTGGA CGATATTTAC
ATTGGGGGTG GCAAAGGTCA GTCAGGTGAA TTATTCATTC AGCAAACGGG TGGACGGTTT
GAGAAAAGCG CTCAGAATGC CTTTAAACAG GATGCCCTTT GTACCGATAC CGACGCAGCT
TTTCTGGATG CTGACAGCGA TGGCGACCTG GATTTGTATG TTACGAGTGG TGGGTACGAG
TACCTTCCCA ATGACCTCCT GCTCCAGAGC CGTCTGTACC TGAATGACGG CAAAGGAAAT
TTTAATAAAG ATGCCAGCCG CCTTGATCTA AACGACTATG CCGGTAACGC TGTAGAAGTG
CTTGATTTCG ATAAAGACGG CGATAGCGAT TTATTTGTGT GTGGGTCCGT TATGCCCAAT
CAATACCCCC GGTATCAGAC GAGTCGGTTG TACCGCAACG AGAAAGGGAA ATTCGTGCCC
GTAAAAAATG ACGCGTTCAA TGACCTGGGC CTGCTCACGG ATGCCTGCGT AGTGGATTTT
GATAAAGACG GGTTCGATGA TCTGGTGACC GTTGGGGAGT GGACACCTAT TATTCGCCTG
CGCAATGATC ATGGCGTATT CAAACGGGTG CAGGATGAGC TGGACCAGAC AACCGGTTTC
TGGCAACGTA TCATCGGTGG TGATTTCGAT AAGGATGGCG ATATCGACCT GATTGCCGGA
AACTATGGTC TCAACTGCCA CTTCAAAGCC TCGCCCGCAT TACCGCTCAG TATGTTGACC
GATGACTTCG ACGGAAATGG CACCATAGAT CCCATCGTTT GCTACTATAT ACAGGGGACA
AACTACCCCG CTTATAGTCG GGATGAGTTA TTGGACCAGT TGGCCCCCCT TCGGAAGAAA
TACACCTCCT ATGCCCTGTA CTCCGACGCC ACAGCGGACG AGGTAGTAAA TGAGTTTAAG
GGAAAAACAC CTGCCCGAGC AACCATCAAC GAGTTGTCAA CCCTGTATCT GGTAAACAAT
AAGGGACATT TTGAACGCAA AGAATTGCCT ATACAGGCTC AGTTCTCGCC AGTTTATGCG
ATGGCAACAC CGGACGTCAA TGAAGATGGC TTTCCTGACC TGCTGCTCGC AGGGAACCAG
ACCCACGGGC GTGTGCGAAC GGGCAACATC GATGCCAATT ACGGACAGGT ATTTGTGAAT
GACCGCAAGG GAGGTTTTAC ATACATGCCT CAGTCACAAT CAGGCCTGTT CTTGCGGGGA
AATGTCCGCT CACTGCTCGT TGTCAACAAC CAGCTTATGG CGGGTATCAA TAGCGAAAAG
GTACAGGTTT ACACGAAAGC GAAATAG
 
Protein sequence
MKLIRFLAFL PLLAACKGGS SDSVFTLLSA SQTHIDFVNS IQETEEDNVL NYEYFYNGGG 
VAAADFNNDG LIDLYFTANQ GEDKLYLNEG KLSFKDITKE AGIDWKGEWK TGVTVVDINK
DGWQDMYVSV SANIDKPALR KHKLYINNKT LKNGVPSFTE QAAAYGLDLT TYATQSAFFD
YDNDGDLDVY LLNHNVKDFK RFDADAVHAM RDSLAGHRLM RNDGGKFVDV SVQAGIKGNP
IGFGLGVHTA DLNGDGWLDI YVSNDYVEED YLYLNNKNGT FTDVVKEATG HVSYFSMGND
VGDINNDLLP DIVTMDMLPE DNKRQKLLFG PDKYEAYLSM LRNGFHPEVM RNMLQLNNGV
DRQGRPQFSE IGQLAGIAST DWSWSALLAD YDNDGYKDLF ITNGYLRDYT NNDFVKYYAD
QGARKNQSVM EVISHMPSTK TPNYIFRNEH NLTFSNKQTD WGFDTPVISN GAVYADLDND
GDLEIVTNNI NEKAHLYQNQ TAEKTGNNYV DIVLNPKQAA QSATGTKVYV YSGDLRQFQQ
YTPTHGFQSS MMIPMHVGLG KAKTIDSLVV VWQNGSVQKL RNVAVNQRLT INYEPGTEAS
TPILSQPALL FAQTNTLDFQ HQQAPLNDFS RQLLLPHMYS YAGPRMVKGD VNKDGLDDIY
IGGGKGQSGE LFIQQTGGRF EKSAQNAFKQ DALCTDTDAA FLDADSDGDL DLYVTSGGYE
YLPNDLLLQS RLYLNDGKGN FNKDASRLDL NDYAGNAVEV LDFDKDGDSD LFVCGSVMPN
QYPRYQTSRL YRNEKGKFVP VKNDAFNDLG LLTDACVVDF DKDGFDDLVT VGEWTPIIRL
RNDHGVFKRV QDELDQTTGF WQRIIGGDFD KDGDIDLIAG NYGLNCHFKA SPALPLSMLT
DDFDGNGTID PIVCYYIQGT NYPAYSRDEL LDQLAPLRKK YTSYALYSDA TADEVVNEFK
GKTPARATIN ELSTLYLVNN KGHFERKELP IQAQFSPVYA MATPDVNEDG FPDLLLAGNQ
THGRVRTGNI DANYGQVFVN DRKGGFTYMP QSQSGLFLRG NVRSLLVVNN QLMAGINSEK
VQVYTKAK