Gene Slin_3971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3971 
Symbol 
ID8727729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4768796 
End bp4770583 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content55% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388760 
Protein GI284038830 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000911124 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTT TTGAATTCCT GATACCCCGA TTAGCCAGCA CGCTACCCCG GCGTTCACTC 
CGCTCCGGCT GGCTTTCGGC CGTGCTTTTT CTTCCACTTT CGGTTGCCTC AAACACGCCC
GGTCAGCCAC CCGCTGCTGC CACACCGAAG ACCGTGGTGC GGCAGACCGA GGTAAAGGAA
TCCATGGTGC AAAAGGCCGT GGTGAGGCAG GCCGATGTCA TTATTTACGG AGGTACATCG
GCGGCTGTTA CGGCGGCTGT TCAGGTAAAG AAAATGGGCA AATCGGTCAT CATCGTTTCG
CCGGATAAAC ACCTGGGTGG CCTTTCGTCA GGTGGACTTG GCTTTACCGA TACCGGAAAC
AAAGAAGTGA TCGGGGGCCT CTCGCGCGAG TTTTACCAGC GACTGTACCA GCACTATCAG
CAGAAAGACT CCTGGAAATG GCAGAAGCCC GAAGAGTACG GCAACAAAGG CCAGGGTACC
CCCGCCATTG ACGGTACCAG CCGGACCATG TGGATTTTTG AACCCCATGC CGCCGAACAG
GTCTTCGAGG ATTTTGTCAA AGAATATAAA CTGACCGTTT ACCGCGATGA ATGGCTCGAC
CGCTCGGCCA AGGGATTAAC CAAACAAGGC GGTAGCATAC GCTCGTTCCG GACGTTGAGC
GGAGCCGTCT ATCAGGGAAA AATGTTCATC GATGTCACGT ACGAAGGCGA TCTGATGGCC
GCTGCGGGTG TAAAATACCA CGTGGGCCGG GAAGCCAACA GCGTGTACGG CGAAAAATGG
AATGGCGTGC AGAAAGGCGT TTTTCAGCAC GGACACCACT TCAAAACCAA CATCAGCCCA
TATAAAATAC CGGGCGATCC GGCTAGCGGA CTTCTCCCTG AAATATCGGC AGAACAACCC
GGCGAGAACG GAACCGGCGA CAATAAGGTC CAGACCTACT GCTTCCGGAT GTGCCTGAGC
AACCACCCCG ACAACCGGGT GCCCTTTCCG AAACCCGAAG GCTACAACCC CGACCGCTAC
GAACTGCTGG CTCGGGTATT TGCGGCCGGC TGGCGCGAAA CCTTCGATAA ATACGACCCT
ATTCCCAACC GCAAAACGGA CACCAACAAC CACGGCCCGT TCAGTACGGA CTACCTCGGC
AAAAACTACG ATTATCCAGA AGCGACCTAC GCCCGCCGGA AGCAGATCAT TAAGGACCAT
GAACTGTACC AGAAGGGCCT GATGTACTTT CTGTCCAATG ACCCACGCGT TCCGGCCGAT
GTTCACCAGG CCATGCAGCA GTGGGGCCTG GCGAAAGATG AGTTCACCGA TAACGGAAAC
TGGCCGCATC AACTCTATAT TCGCGAAGCC CGACGCATGC TGGGTGTCTT TGTCATGAAA
GAAGCCGACG CGCTTGGCAA AACAACCGTG CCGCAGCCCA TTGGCATGGG CTCCTATTCG
CTGGATGCCC ACAATGCCCA GCGGTACGTT AAACCAGATG GTTTCGTACA GAACGAAGGC
GACATTGGCG TGCATCCTGA CAAACCCTAT TCGATTGCTT ACGGGTCTAT CTTGCCCAAA
GAAACAGAGT GCAACAACCT GCTGGTGCCG GTTTGTGTAT CCAGTTCGCA CATCGCCTAC
GGCTCCATTC GGATGGAACC CGTATTCATG ATTCTGGGCC AGTCGGCAGC CACGGCGGCT
GTGCTGAGCA TCGACAACAA AGTGAGTCCG CAACGGCTGC CCTACGAGAC GTTGAGCAGG
GTGTTGTTAA ATGATAAACA GCGGTTAACG CTGAATGCAG CAAACTAA
 
Protein sequence
MNIFEFLIPR LASTLPRRSL RSGWLSAVLF LPLSVASNTP GQPPAAATPK TVVRQTEVKE 
SMVQKAVVRQ ADVIIYGGTS AAVTAAVQVK KMGKSVIIVS PDKHLGGLSS GGLGFTDTGN
KEVIGGLSRE FYQRLYQHYQ QKDSWKWQKP EEYGNKGQGT PAIDGTSRTM WIFEPHAAEQ
VFEDFVKEYK LTVYRDEWLD RSAKGLTKQG GSIRSFRTLS GAVYQGKMFI DVTYEGDLMA
AAGVKYHVGR EANSVYGEKW NGVQKGVFQH GHHFKTNISP YKIPGDPASG LLPEISAEQP
GENGTGDNKV QTYCFRMCLS NHPDNRVPFP KPEGYNPDRY ELLARVFAAG WRETFDKYDP
IPNRKTDTNN HGPFSTDYLG KNYDYPEATY ARRKQIIKDH ELYQKGLMYF LSNDPRVPAD
VHQAMQQWGL AKDEFTDNGN WPHQLYIREA RRMLGVFVMK EADALGKTTV PQPIGMGSYS
LDAHNAQRYV KPDGFVQNEG DIGVHPDKPY SIAYGSILPK ETECNNLLVP VCVSSSHIAY
GSIRMEPVFM ILGQSAATAA VLSIDNKVSP QRLPYETLSR VLLNDKQRLT LNAAN