Gene Slin_4333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4333 
Symbol 
ID8728093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5253154 
End bp5255430 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content56% 
IMG OID 
ProductFG-GAP repeat protein 
Protein accessionYP_003389114 
Protein GI284039184 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00966426 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.30257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TGTTTACGCT TTGTTTTTTA GCAATTCCAT TTCTCACCCA TGCGCAGTCG 
GGCGGGAAGA TCGCGTTTCG ATACGACCAG AGCCCAACCG TAAGCATTAA TAACCGCCCG
CTGCTCAATC CGTGGGTGGG TGGCCTGAAC ACGACGCAGT ATTCAACCAT TCGCCTGAAC
AACGACACCC GCGACGATCT GGCCGTTTAC GACCGGACAA CCGGTAAGGT CAGCACGTTT
ATCGCCGTCG ACAATCCCAT TGGCAGCGGC ACGGTATGGC AGTACGCGCC CGAGTACGAA
CTAGCTTTCC CGGCGGTCAT GTACAGCTGG ATGCTGCTGG TCGATTATGA CTTCGACGGC
CGGAAAGACA TTTTCACCAA CAGTTCGAAG GGCATCAGCG TATGGCATAA CGAATCGCAG
AACGGGACGG TATCGTTCAA ACTGGCAGTC GATCCGCTCC GAACGCTTGG CTTCAGTGGC
TTTCAGATCC CCCTTTACGT CAATGCGTCC GATCTGCCCG CTATTATGGA TTACGACGAC
GATGGCGACA TCGACATTAT CACCTTCGAT GCCGATGGCA ACATCATTGC CTACCAGCAA
AACATGAGTA TGGAGCGAAC CGGCACGAGA GGACTCGACT TTGCCCGCGC CGGTAATCAG
TGCTGGGGGC ATTTTCAAAA AGAGTTTTGC AACGACTTCA GATTTGGGAT CAATTGCGAC
GATGGCTCCG GCACCGGTGG GCGTCTGGCC GCTCCAACTG CTACCGGTCG AGTCAGCCCG
ACTAGTCCCA GCGGGGCAAG ACCGCTCCAT TCGGGTAACA CCCTCACGGT GATCGATACT
GATGGCGATG GTAAGAAAGA CTTGCTGTTC GGATTTGTAA GCTGTGAAAA CATTGCCCGC
CTGCGAAACA CCGGGCCCAA TAGCGTAAGT GCTAACTTCA CGAGTTACGA TAGCCTGTTT
CCCGCCCGAA ACCCCATTCT GTTTCCGGCC TTTCCGGCCA CTTTTTTTGA GGATGTCGAC
GGCGATGGGC AGAAAGATTT GCTCGCATCG CCCAATGTAA ACTTCAACGA CGGCAATGTC
TACGATTTTC GGGCGTCGGG CTGGTTTTAC AAGAATACGG GTACTACGCA AAAGCCCGAT
TTTCAGCTCA TTCAGAAAGA TTTCCTGCAA AGTGACATGC TCGACCTGGG CGAACGCACG
GCCCCCGCGC TGGCCGATCT GGATGGCGAC GGCGATATGG ATTTACTGGT CGGGTACGGC
GGGGTAGGGG TTGGCTCCGG CTACCGGGGA GGCATCTGGC AATTCGAGAA TAAGGGGACA
ACGCAAAATC CGGCTTTCGT GCTCGTCACA ACCGATTACC TGGGTATACA GTCGCTGGGG
CTGACAAACG TGGTGCCTTC TTTTGCCGAT GTGGATGCCA ATGGTAGTAT GGACCTGATC
GTCACCGCCA CGGGGAAACA GGCCGTAGAA ATTCGCGCGC TGATCAATAC CGCTCCCAAA
GGAGCTGCCG TCCAATACAG CCTTGCCAGT GCCACCCGCT GGCCCACGCC CGATCTGATG
TACCCGCTCG ATCTGCTGAC GGTTACGGAT GTCGACAAAG ACAACAAACC GGACTTACTG
ATTAGCCGGT ACAACGTGGG TACCATTCTC TATTACCGCA ATGCTGGCAC GGCCACGGCT
CCCGTTTTCC AGTTGCAGAA CCAGACGTTC GGTGGGATCA CGACGGACGA TTACATCTAC
GCCCGCGCCC GGTCGCTGGT GGTGGCCGAC CTGAACGGCG ATCAGAAAAA CGAACTCATT
GCCGCAGCCG ATAACGGTTC GGTTAAGGTC TATCAATTTC CCGAAACCCC GACTCAGGCG
TTTACGCTGA TCGACTCGCT GGCGGGCATA GGCTTGCCCG GCAAAGGACT TATTGCCGCA
GCCGCCGACC TCGACGGCGA CCAACTGCCC GATCTGTTGC TTGGTGGTAC GGGTGGCGGA
TTGCGCTATC TGAAAAACAC CTCCCAAAAG ATCGTTGTGA CCGGCCTACC CGAAGAACCG
ACCGGCCCAT GGGTGTTCCC CAACCCCACT AACCGGTACA TTACCGTTCG TCCGCACTAC
GATGGGCGCG TTGAACTGGT GTCTTTAACG GGCCAGACCG TGGTGCCCGT TCAGCCGGTT
AAAGCCGGGA CAGAAAGTCT CCTTGATTTA GGTGAGCTGG CCGATGGAAC GTATCTGATC
CGACTCCAGA GCGATAACCG CCCGGTACAA ATTCAGAAAG TGGTGGTCTG GAAATAA
 
Protein sequence
MKKLFTLCFL AIPFLTHAQS GGKIAFRYDQ SPTVSINNRP LLNPWVGGLN TTQYSTIRLN 
NDTRDDLAVY DRTTGKVSTF IAVDNPIGSG TVWQYAPEYE LAFPAVMYSW MLLVDYDFDG
RKDIFTNSSK GISVWHNESQ NGTVSFKLAV DPLRTLGFSG FQIPLYVNAS DLPAIMDYDD
DGDIDIITFD ADGNIIAYQQ NMSMERTGTR GLDFARAGNQ CWGHFQKEFC NDFRFGINCD
DGSGTGGRLA APTATGRVSP TSPSGARPLH SGNTLTVIDT DGDGKKDLLF GFVSCENIAR
LRNTGPNSVS ANFTSYDSLF PARNPILFPA FPATFFEDVD GDGQKDLLAS PNVNFNDGNV
YDFRASGWFY KNTGTTQKPD FQLIQKDFLQ SDMLDLGERT APALADLDGD GDMDLLVGYG
GVGVGSGYRG GIWQFENKGT TQNPAFVLVT TDYLGIQSLG LTNVVPSFAD VDANGSMDLI
VTATGKQAVE IRALINTAPK GAAVQYSLAS ATRWPTPDLM YPLDLLTVTD VDKDNKPDLL
ISRYNVGTIL YYRNAGTATA PVFQLQNQTF GGITTDDYIY ARARSLVVAD LNGDQKNELI
AAADNGSVKV YQFPETPTQA FTLIDSLAGI GLPGKGLIAA AADLDGDQLP DLLLGGTGGG
LRYLKNTSQK IVVTGLPEEP TGPWVFPNPT NRYITVRPHY DGRVELVSLT GQTVVPVQPV
KAGTESLLDL GELADGTYLI RLQSDNRPVQ IQKVVVWK