Gene Slin_4614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4614 
Symbol 
ID8728378 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5601522 
End bp5603927 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content53% 
IMG OID 
Productprotein of unknown function DUF214 
Protein accessionYP_003389391 
Protein GI284039461 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.122129 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGAA ACTATTTTCT TATCGCCTTT CGCAATTTGC GGAAACATAA ATCATTCAGT 
TTCATCAACA TCACGGGGGT AGCTGTGGGA TTGGCGTGCT TCCTGCTCAT TGCCCTCTAT
GTGCAGGATG AACTGAGCTA CGACCGCTAC AACACCCATG CAGATCGGAC GTATCGCCTT
ACACGCACGT TCCTCTCGTC GGAAGGCACG GCTTCCCTGC GGCTGGCACA GGCGGCACCA
CCCTTCGGCC CGCTCATCAA GCAGGATTTT CCGGAAGCCG AGCAGGTTGT GCGAACGATT
GATAACGGAG GGCTGGTAAA ATACGGCGAG CACTCGTTCA ACGAGGAGGA TATGTTTTTT
GCCGAAGCGA ATCTGTTCAA GGTGTTCGAT TTTAACGTCA CCAGCGGAAA CCCCGAACAC
GCGCTGGTGA ATCCGTTTTC GATCATGTTC TCCCGGCCGA TGGCGGAGAA GTACTTTGGC
CGGGAGAACC CGGTCGGTAA AACCGTTCGC CTGTATGACC AGTTCGATCT GACCGTAACG
GGTGTGTTTG AGCCTTTGCC CGCTCAGTCG CATTTTCATC CCAGCTTCCT AGTGTCGTTT
TCGACCTTCA ACGACAACCG CGTTTACGGC GCAGAACAGC TTCGGACCAA CTGGAGCAAC
AACTCATTCA ACACCTATGT GCTGCTAAAG CCCAACGGTA ATCCACAGCG AATGGAAGCT
GCGTTTCCGA GCTTTCAGGA CAAATATGTT CCGGCCGAGG AAGGGCGTAA AGCATCCGCT
TTTTCAATAC TGAACCTCCA AAAACTCACC GATATTCACC TGAAGTCGCA TACCGATTCG
GAGATAGAAC CCACCGGCGA CATGAGTTAT ATCTATCTGT TTTCGGCCAT TGGTTTATTC
ATTCTGCTCA TTGCCTGCAT CAATTACATG AATCTGGCAA CTGCCCGGTC GGCAGGGCGG
GCCAAAGAGG TTGGGATGCG CAAGGTTGTA GGGGCTTTGC GTTCCCAACT CATTGGGCAG
TTTTTGAGCG AGTCGATTTT AGTGGTAACC TTCTCTTTAT TCATTGCCAT CGGGCTGGTG
CTACTTTGTT TGCCCGTGCT GAACGAGTTT ACGCAAAAAC ATATGGCGTT TAGCCAATTG
CTTGACCCTG TCTTTTTGAG TGTCCTCATT GGCATTACCT TACTCACCGG CCTAGTGGCA
GGTAGTTACC CCGCCTTCTT CCTCACCTCC TTCCGGCCAT TAGGCGTGCT GAAAGGACAG
ATTGCATCGA CCATGCGGAC GGGCAAACTA CGGCAGGTGC TGGTGATCAC GCAGTTCGCT
ATTGCCATCG CGCTCATTAT CAGCACGGCC GTCGTCTATA ACCAGATGAA ATACATTCAG
AATTACCGGC TGGGCTACGC GAAAGATCAG GTACTTCTCC TGTCAGACAT TGGTGACTCA
ACAACGAATT ACGAAACCCT AAAACAGCAA CTCCTGCAAA CGGGTGCCGT ACGCGACATG
GGCCGTTCCT CGCGCGTACC GTCGGGCAGA CTGCTCGATT CATACGGCGG CATGGCTATG
AAAGGCGACA GCATGGCCCC GGTAAAAATC AACTTACGGG GACTGCGCGT CGATTACGAT
TTCATCCCGG CTTACCAGAT CAGCATGGCC GCTGGGCGCA ACTTCTCGCG GGCCTACTCC
ACCGACACGT CGATGGTGGT GCTGAACGAA ACAGCCGTGC GTCAGTTGGG CTGGACACCC
GAACAGGCCA TCGGCAAACC GTTTCAGTAT GGCCCCGCCA AAGGCCAGAT CATCGGCGTA
ACGAAGGATT ACCACTTTGA ATCCCTGCAT CAGCAGGTGG CTGCACTGGC CATGATTCTG
ACACCCCGTC AGCTTAACTG GATTTCCATC CCACTCAAAG GAAATATTAC GGCGAGTATT
CAGCAGGTCG AATCGGTCTG GAAGCAATAC TTCCCGCAAC GCCCGTTCGA CTACCAGTTT
CTGGATACCC GCTTTGACCG GCTCTACGCC CGCGAGCAAA CGCAGCAAAC GTTGTTCAGC
ATCTTTGCGG GAGTGGCTAT CCTTATTTCG TGCCTCGGTT TGTTTGGCCT GTCGATGTTC
ATGGCGGAAC AGCGTACCAA GGAAATCGGT ATTCGCAAAG TGCTGGGTGC GTCGGAAGCG
AGTCTGGTAG CCTTGTTCTC TCAGGACTTC ATGAAACTGG TACTGGTAGC ATTGGTCATT
GCATCGCCCA TCGCGTGGTA CGCCATGCAC ACCTGGCTCA GCGACTTCGC CTACCGCACC
GACATCCACT GGTGGGTATT CCTGCTGGCC GGTGGCCTGA CGATTTTCAT AGCCTTATTA
ACCGTAAGTT TTCAAAGCGT GAAAGCCGCC TTGATGAACC CGGTAAAATC ATTACGGTCG
GAATAG
 
Protein sequence
MLRNYFLIAF RNLRKHKSFS FINITGVAVG LACFLLIALY VQDELSYDRY NTHADRTYRL 
TRTFLSSEGT ASLRLAQAAP PFGPLIKQDF PEAEQVVRTI DNGGLVKYGE HSFNEEDMFF
AEANLFKVFD FNVTSGNPEH ALVNPFSIMF SRPMAEKYFG RENPVGKTVR LYDQFDLTVT
GVFEPLPAQS HFHPSFLVSF STFNDNRVYG AEQLRTNWSN NSFNTYVLLK PNGNPQRMEA
AFPSFQDKYV PAEEGRKASA FSILNLQKLT DIHLKSHTDS EIEPTGDMSY IYLFSAIGLF
ILLIACINYM NLATARSAGR AKEVGMRKVV GALRSQLIGQ FLSESILVVT FSLFIAIGLV
LLCLPVLNEF TQKHMAFSQL LDPVFLSVLI GITLLTGLVA GSYPAFFLTS FRPLGVLKGQ
IASTMRTGKL RQVLVITQFA IAIALIISTA VVYNQMKYIQ NYRLGYAKDQ VLLLSDIGDS
TTNYETLKQQ LLQTGAVRDM GRSSRVPSGR LLDSYGGMAM KGDSMAPVKI NLRGLRVDYD
FIPAYQISMA AGRNFSRAYS TDTSMVVLNE TAVRQLGWTP EQAIGKPFQY GPAKGQIIGV
TKDYHFESLH QQVAALAMIL TPRQLNWISI PLKGNITASI QQVESVWKQY FPQRPFDYQF
LDTRFDRLYA REQTQQTLFS IFAGVAILIS CLGLFGLSMF MAEQRTKEIG IRKVLGASEA
SLVALFSQDF MKLVLVALVI ASPIAWYAMH TWLSDFAYRT DIHWWVFLLA GGLTIFIALL
TVSFQSVKAA LMNPVKSLRS E