Gene Slin_0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0335 
Symbol 
ID8724063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp427440 
End bp430394 
Gene Length2955 bp 
Protein Length984 aa 
Translation table11 
GC content55% 
IMG OID 
ProductPKD domain containing protein 
Protein accessionYP_003385198 
Protein GI284035268 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.16291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.953666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAA CGATAATTAA CACCGGCAAA CGGACCACCC GTTTGCCGGG TCTGTTCAAC 
CTGTGCGTTA ACGCAGGTCT GTTGCTGGGA CTGGCTTCGT TTGTGATGCA AGACGGCTCC
ACCAAACCCG ACGAAACCCG GTTTACGCCA GTTGTGCTGG CCGAAGACCT CGATGAGCCG
ATGGTGTTTG AGGTTGCCAA AGACGGAACG GCCTTCATTA TCGAGCGCAA AGGAGCCTTG
AAAAAATATG ATCCGGTCAC GAAAACGGTC GATCTGATCG CGACCATCCC GGTCAATACG
AAATACACCT CGGCACAGGG GCGCGTGTCG GAAGCTGAAG AAGGGCTTCT GGGCCTGTCG
CTGGACCCCA ATTTCGAGCA GAATCACTGG ATGTATCTGT ATTACGCGCA CCCAACCGAG
AAGAAACACA TCCTGACGCG CTGGGAGTAC CGCAACGAAA AACTGGTCGA GAACTCGCAG
AAAGTGATGC TGGAAGTAAC AACCCAGCGT GAAGTGTGCT GCCATACGGG TGGTGGAATG
ACCTGGGATC GGGCGGGTAA TCTGTACCTC ACCGTAGGTA ACAACACCGG AAACCAGCAG
GCTGCCCAGA CCGACGAACG CCCCGACCGC AGTAGTTGGG ACGATCAGGG CCATGCCGGC
AACACCAACG ACTTACGGGG AAAAATCCTC CGGATTCATC CCGAAGCCGA TGGTACGTAC
TCCATTCCGG AAGGAAACCT GTTCCCAAAA GGCACCGAAC CGTCGGACCG CGCCAAAACC
CGCCCCGAAA TTTACTCTAT GGGGCATCGT AATGCCTGGC GTATTTCAAT CGACAGCCAG
ACGGGTTATG TGTACTGGGG CGAAATCGGT CCCGATGCGA CCAAGGATTC TGAAATTGGT
CCGCGTGGTT ACGATGAACT AAACCAAGCC CGCAAGCCGG GCAATTTTGG TTGGCCGTGG
TTCGTAGGAA ACAATCAGGC ATTTCCGGTG TATGATTACG CCAACAAGAA GCCACTGGAG
AAAAAAGATC CGAAAAACCC GGTTAACAAT TCGCCTAATA ATACCGGGCT GACGAATCTG
CCGCCAACGG CTCCTTCGTT TATTTATTAC CCATACGCGG TTTCGGAAGA GTTTCCACTG
GTGGGAACGG GTTCACGGTC GGCAACGGGC GGGCCGGTTT ATCGCCAGGC CGACTTTAAA
GGGGCCAAGC GGCCCTGGCC AGCGTATTAC GAAGGGAAAT GGCTCGTTAC TGACTTTTCC
CGAGGCTGGA TTATGGCCGT TTCTATGGAT GCCGAGGGGA ATTACAAGGG TATGGAGCGG
GTTCTGCCTA CTTATCACCC GGTGGAGCCC ATCGACATGA AGTTTGGCCC CGACGGCGAC
CTGTACGTAC TCGAATACGG TAGCAACTGG TTCCGAAAAA GCGACAATGC CCGCCTTGTC
CGGATTGAGT ACAACAGTGG CAACCGGAAG CCAATTGTGC AGGCATCGGT AGCGACGAAT
CAAGGCAGTA AGTCGGGCGG TACGTTGCCG CTTCAGGTCA CATTATCGGC CGATGGGAGC
AAGGATAATG ATGGCGATGC GCTCAGCTAC CAGTGGAAAG TGACCTCGCC GGGTATTGCC
CCAAAGGTGT TTACGACTGC TAATCCCACC GTTACCTTCG ACAAAGCCGG TGTTTACACC
GCCACGCTGA CGGTCACGGA TGCGCATGGT GCCGCTAACA GCCAGTCGGT ACGCATCATT
GCCGGAAACG AAGCACCCGT TGTCGCGGTG AATCTGACAG GAAATAAAAC CTTCTTTTTC
CCCGATCAGC CCATTCAGTA CGCAGTCAAT GTGTCGGATA GGGAAGATGG TACCCTGGCT
AAATCAACCG CTGCTCCGGG CCAGATAAGT CCGGCTCGGG TGGCTATGAG CATCGACTAC
ACCTCCGAAG GCTTCGATTA TGCCGAAGTA ATGCAGGGGC AGCGGAGCGT CGATGCGTCT
ACGCAATATG CTGTGGCACA GGCGCTGATT AGCCAGAGCG ATTGTAAAGT ATGCCACCAG
ATCGACACCA AGTCGGTGGG GCCTGCCTTC ACTGCCGTTG CCGCTAAATA CAAGGGAGAC
ACCGGAGCAC CGGCGCGACT CGTCAGCAAA ATCCGACAGG GTGGTGTAGG CGTCTGGGGC
GATGTTGCCA TGCCTGGTCA CCCGGCCATG TCGGTTGCCG ATGCGGGTAT TCTGGTGAAC
TATATTCTGC ATATCAACGA AAAAACGCTC AGCAGCTTAC CCATGGAGGG TACCTATACC
CTGAAAATTC CGGAAGGCGA CAAAGGCAAC GGGTCCGTAC TGATTCGGGC GGCTTACACC
GACCGGGGCA AGGCAGCCGC TAAAGGCGGC AAGCCTGTAC CGGCCCAGAC CAGTGAGCAG
TTACTGGTAC TGCGCAGCCC ACAACTCGAT GCGTCTACCG CTGCGATTAT TCGTGGTGCA
GAGGTAAAAG CTAAAGGTAT GGGCAAAGGG GAGAATGTGA TTCCATACGC AAACAGCTAC
ATCGGTTTCC GGAAGCTGGA TCTAACGGGT ATCAAGCAGC TTGAACTGAC GGCGTCGGCA
CAGCGACGCG AAGGAAGTTC GGGCGGTACC ATCGAGGTTC ACCTCGACTC GCCCACCGGC
CCCCTCGCTG GTGAAACAGT TGTTGAACTG GCACCGGAAG TGGACATGGA AAAGCTGATG
GCTCAGCTGG AAGCCGGACC GAAACCACCC GCCGGTGGTG CCAATGGTTC GGCAGCAACA
CCCGGTGGCC CTGCTGCTCC CGGCGAACCT GCTAAACCCC GGCCCAATCC ATTTGCTCGG
CCCCCTGTGT ACCTGACGCT GAAAAATGCG GAGGGTGTTC ACGACGTGTA TTTTGTCTTT
AAAAACGATC AGGCAAAAAA CATACAGCCG TTAATGTCGC TGTCGAGCAT TAAGTTTATG
AATAAAGAGA AGTAA
 
Protein sequence
MRKTIINTGK RTTRLPGLFN LCVNAGLLLG LASFVMQDGS TKPDETRFTP VVLAEDLDEP 
MVFEVAKDGT AFIIERKGAL KKYDPVTKTV DLIATIPVNT KYTSAQGRVS EAEEGLLGLS
LDPNFEQNHW MYLYYAHPTE KKHILTRWEY RNEKLVENSQ KVMLEVTTQR EVCCHTGGGM
TWDRAGNLYL TVGNNTGNQQ AAQTDERPDR SSWDDQGHAG NTNDLRGKIL RIHPEADGTY
SIPEGNLFPK GTEPSDRAKT RPEIYSMGHR NAWRISIDSQ TGYVYWGEIG PDATKDSEIG
PRGYDELNQA RKPGNFGWPW FVGNNQAFPV YDYANKKPLE KKDPKNPVNN SPNNTGLTNL
PPTAPSFIYY PYAVSEEFPL VGTGSRSATG GPVYRQADFK GAKRPWPAYY EGKWLVTDFS
RGWIMAVSMD AEGNYKGMER VLPTYHPVEP IDMKFGPDGD LYVLEYGSNW FRKSDNARLV
RIEYNSGNRK PIVQASVATN QGSKSGGTLP LQVTLSADGS KDNDGDALSY QWKVTSPGIA
PKVFTTANPT VTFDKAGVYT ATLTVTDAHG AANSQSVRII AGNEAPVVAV NLTGNKTFFF
PDQPIQYAVN VSDREDGTLA KSTAAPGQIS PARVAMSIDY TSEGFDYAEV MQGQRSVDAS
TQYAVAQALI SQSDCKVCHQ IDTKSVGPAF TAVAAKYKGD TGAPARLVSK IRQGGVGVWG
DVAMPGHPAM SVADAGILVN YILHINEKTL SSLPMEGTYT LKIPEGDKGN GSVLIRAAYT
DRGKAAAKGG KPVPAQTSEQ LLVLRSPQLD ASTAAIIRGA EVKAKGMGKG ENVIPYANSY
IGFRKLDLTG IKQLELTASA QRREGSSGGT IEVHLDSPTG PLAGETVVEL APEVDMEKLM
AQLEAGPKPP AGGANGSAAT PGGPAAPGEP AKPRPNPFAR PPVYLTLKNA EGVHDVYFVF
KNDQAKNIQP LMSLSSIKFM NKEK