Gene Slin_5004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5004 
Symbol 
ID8728768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6095780 
End bp6098812 
Gene Length3033 bp 
Protein Length1010 aa 
Translation table11 
GC content55% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389780 
Protein GI284039850 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGAA ATACAGTAAG CCGTTTTGTA TTGCTGGTAC TGATCGTTTG TTTCTCCGGA 
TTCGTGGCAG ACCCGCTGGA GGGTGCATGG GACTGTTCGA TCAATCTGGA CGGGAAACTG
GTCGGAACCC TGAAGGTAAG CCGGGTGAAT GCGTCGTATA CCGGCGTGTT CTTCTCCTTT
GAACGGGGAG AAATACCCGT CCACCATGTA ATGGTTGATG GAAACAAACT GTCCGGGGAT
TTTACGCTGA GGAAAGAAGT ATACCAGTTG GTGGCTGAGG CAACGGGAAA TACGCTGACG
GGCAGTGTTG TGTCGGGTGC CAAACGGTTT ACGTTGCTGG CTAATCGAAT CACCGAACCC
GCACAGGCGT ATGTGCCTCC TCGGGTCTCT TACATTCTGG CGGATAAGGA ACTGAATGGG
TTTGAGCAGG ATATTGACCA TGCGGGTCTG ATTGAACGGT TCGACCGGGA GGGTTATACA
CGGGGCGAGC GGATTTACAA TGCGGTTTGT ATCAATTGCC ACGGCAATCC GGAGTCGGAA
GGGTCGATTC CGATGTCGCA CAAGTTCTGG AAGCAGCCGT TCAAGGCCGG TTACGACCCG
TTTTCAATGT ACCAGACCAT CACCAAAGGG TACGGGTCTA TGCCGCCCCA GATGACGCTC
AGCCCGCAGG AGAAGTATGA CGTAATTTCG TATATCCGGG AGAATTTCGT CAAGGAAAGT
AACTCCAGCC AATACCTCCG CGTAACGCCC GGTTACCTCG CCAGTCTGCC GAAGGGAACG
TCAAGAGGCC CGGCGGTAAA ACCCTATCAT CCCTGGTCGG ACATGGACTA TGGGAGTTTC
TTTATCAATA CGTACGAACT GGCCGATTCG GCTACAGGCC CGAAACGGTT CCATTCGCCG
GGACGCCCTC CATATGCTGA CGAAGATTAT CGGAAAAATA ATTTTGCCTA TAAAGGCATT
GCCGTTCGGG TCGATACCGG GGCTGGAGGT GTGTCGGCCG GGAAGGCCTG GATGATGTTT
GACCACGACG TCATGCGCGT AGCCGGTGGC TGGACGGGCA GGGGATTTAT TGACTGGGAG
GGAATTCTGC TCAACGACAA CCACGAAACC TACCCCCGAA CCATTGGCCA GCTACATTTC
GAAACGCCCG TTGGCCCCGG CTGGGCAAAC CCCGTTACCG GCTCGTTCAA CGACCCCCGA
TTTACGGCGC GGGACGGTCG TCAGTTTGGG CCGTTGCCCA AAACGTGGGC CAACTACAAA
GGGCTGTATC ATCATGGAAA CACGGTAATT ATTGCCTACA CCGTCGGCCA GGCGGCTATT
CTGGAAAAAT TGGGGATGGA AACGGTTGAG GGCCAACCGG TTTTTACACG AACCCTGACC
ATTAATCCGG GTCAGCAACT GCTGAAAACC AGAGTGGCGC GGGTCGGCAC CAACGTCGCG
GTCGTTGGGA AAGGAGCATC GCTGACGGAG GAAAACGGAT ACGTTGTTTT GCAGATTACG
GCCTCGGCAA AAACCACCAT CAAACTGTTG ATGGCCCGGC CCGGTAGCAA AAGCCTGACC
GCTCTGGCGG CCCGTTCACC AACGCCCGAA TCGCTCGAAA AATACACCAA AGGCGGGAAG
GCGCATTACC CACAAACGCT GTACACGACC ATTACGCGGG GAAAAGACGA CGGCTTGTTT
GCGGTGGATC AACTGTCGCC ACCCTACGAC AATCCGTGGA AATCGCGGAT GAAGCTCAGC
GGTATCGACT TCATGAAGAA CCCCAATGTG GCGGTTTTAT GCACCACCGA TGGCGACGTG
TGGCGGGTGA CGGGCCTGAC CGATGCCAGC GGGCGTCTTG GCTGGAAGCG CATTGCTTCG
GGGCTTTTTC AGCCGCTGGG TATCAAAGTG CTGAACGAAA AAATATACGT TACCTGCCGC
GACCAGCTGG TGCTACTTCG TGATCTGAAC GGCGACGGCG AAACCGATTT TTACGAAAGC
TTCAACCACG ACCATCAGGT GACGGATCAT TTTCACGAAT TCGCGATGGG CTTGCAGGCT
GATAAAGAAG GCAATCTGTA TTATGCCAAA AGTGGTCGTC ATGCCCGCGA AGCCCTGATT
CCCCAGCACG GAACCCTGCT GAAAGTCAGC AAGGATGGGG CCACCACGCA GATCATGGCT
ACCGGGTTCC GAGCTGCCAA CGGCGTGTGC ATCAACCCCG ACGGTAGTTT TATCGTGACC
GACCAGCAGG GCTACTGGAA CCCTATGAAC CGCGTCAACT GGATTGAGGG CAAGGGGAGG
TTCTACGGTA ATATGTGGGG CTACAACCCG CCCAACGACT CCACCAAAGC CGCTATGGAG
CAGCCGCTGG TTTGGATTGA TATGGAGTTC GACCGGTCGC CGTCGGAGTT GCTGTGGGTG
GACAGTAAGA AATGGGGGCC ACTGAACGGC AGCCTGCTCA GTTTTTCGTA CGGCTATGGC
AAGATTCAAC TCGTGCTACC CGAAGACGTG AATGGCCAGA AGCAGGGTGG CGTGATCGAC
CTGCCGGGGG TTAAATTCCG AACCGGCATC ATGCGCGGTC GCTTCAACCC CGCCGATGGG
CAGCTATACG CCTGTGGCAT GTCGGCCTGG GGAACGTCGC AAACGATTAA AGGTGGCGAT
TTCTATCGGC TTCGCTATAC CGGAAAATCC CTCCCGCTCC CCACCCGGCT GAGTGCCGAA
ACGGATGGTG TTACATTGAC ATTTGCTCGC GAGCTGGCCG AAAACAGCAC CCGGAATCCT
GCCAGTTTTG TGGTGAAAAG CTGGGACATT CGGAGGAGCC GGAAGTATGG TTCGGACCGC
TACAATATAC AAACATTGAC CGTCGCGGAT GTGAAACTAA GCCGGGATAA ACGAACCGTA
AAACTCCTAC TTCCCGGCAT AAAACCCGTC GATGTCATGA CGATTGCCTA CACCGTTGCC
GATACGAAAG GGCAGACGTT TACGGGCACC GTACAGAACA CGATTCATGC ATTACGAAAA
GCCAGTAATC AGGTTTTTAC TGCCGTCAGG TAA
 
Protein sequence
MTRNTVSRFV LLVLIVCFSG FVADPLEGAW DCSINLDGKL VGTLKVSRVN ASYTGVFFSF 
ERGEIPVHHV MVDGNKLSGD FTLRKEVYQL VAEATGNTLT GSVVSGAKRF TLLANRITEP
AQAYVPPRVS YILADKELNG FEQDIDHAGL IERFDREGYT RGERIYNAVC INCHGNPESE
GSIPMSHKFW KQPFKAGYDP FSMYQTITKG YGSMPPQMTL SPQEKYDVIS YIRENFVKES
NSSQYLRVTP GYLASLPKGT SRGPAVKPYH PWSDMDYGSF FINTYELADS ATGPKRFHSP
GRPPYADEDY RKNNFAYKGI AVRVDTGAGG VSAGKAWMMF DHDVMRVAGG WTGRGFIDWE
GILLNDNHET YPRTIGQLHF ETPVGPGWAN PVTGSFNDPR FTARDGRQFG PLPKTWANYK
GLYHHGNTVI IAYTVGQAAI LEKLGMETVE GQPVFTRTLT INPGQQLLKT RVARVGTNVA
VVGKGASLTE ENGYVVLQIT ASAKTTIKLL MARPGSKSLT ALAARSPTPE SLEKYTKGGK
AHYPQTLYTT ITRGKDDGLF AVDQLSPPYD NPWKSRMKLS GIDFMKNPNV AVLCTTDGDV
WRVTGLTDAS GRLGWKRIAS GLFQPLGIKV LNEKIYVTCR DQLVLLRDLN GDGETDFYES
FNHDHQVTDH FHEFAMGLQA DKEGNLYYAK SGRHAREALI PQHGTLLKVS KDGATTQIMA
TGFRAANGVC INPDGSFIVT DQQGYWNPMN RVNWIEGKGR FYGNMWGYNP PNDSTKAAME
QPLVWIDMEF DRSPSELLWV DSKKWGPLNG SLLSFSYGYG KIQLVLPEDV NGQKQGGVID
LPGVKFRTGI MRGRFNPADG QLYACGMSAW GTSQTIKGGD FYRLRYTGKS LPLPTRLSAE
TDGVTLTFAR ELAENSTRNP ASFVVKSWDI RRSRKYGSDR YNIQTLTVAD VKLSRDKRTV
KLLLPGIKPV DVMTIAYTVA DTKGQTFTGT VQNTIHALRK ASNQVFTAVR