Gene Slin_5454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5454 
Symbol 
ID8729221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6634795 
End bp6637173 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content56% 
IMG OID 
Productconserved repeat domain protein 
Protein accessionYP_003390219 
Protein GI284040289 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.901976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.718716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAAGT TTGTAAAATG GGCTGTTTTG CTCATTGGCT GGCCATTAAT GGCTATGGCT 
CAGCTTCAGA TTTCTCACCC GATGGCGCGG CTTGTCGTGC AGCGTGGAAC CGATGGTAAT
GGTCGGCTGT ATCTGTCAGG GCGTTTTACC GGAACCGTAG ACAAGGTTGA GGCTCAACTA
ACGCCCGCTG TGGCGGGTCA GGGGGTAGCT ACGGCCTGGC AAACGGTACA GACGTCGCCC
ACCAACAATC TCTTTCTGGG TTATGTCACG GCCGCCGGGG GCTGGTATGT GCTCACTGTG
CGTACGCTCG TGGGCAGCAC GGTAGTAGAA CAGGCCAGTG TACAGCCGGT GGGTATTGGC
GAAGTATTTA TTACGGCGGG GCAATCCAAT TCGCGTGGGC TGGGCATTGG CGATAATGAC
CTTGGCACCA ATACCGACCG GGTCAATGCT ATCGACTCCA TCAATCACTA CTACCCCCAG
CCGCCATCCT TGCCAGCCCT GGTCTCATCC GGCGATCCTA TGCCGGTGCC ACGGTATAAA
GCCCTGACGG CCGCACGGCG AATTTTCCCG ATGGCCGAAA GCTCCTGGGG CTGGGGCGAA
TTAGGAGATT ATATTGTCAA CCGGTTCAAC GTGCCCGTTG CGTTCTACGT AGCTGGCTGG
GATGCATCGA CCATCGATAA CTGGTACAAA ACGGCGAATG GCATTGCAAC CTGTAACGCC
TATTACTGTG TTGGTGGCGA CTGGCCCAAC CTGCAACCGT ATACGAACCT GAAAAATGTA
CTTCGTTATT ATGGGGCGGT AGCCGGCGTA CGGGCCGTGT TGTGGCATCA GGGTGAGGCC
GAAGGAGATA TTGCGGCATC GAGTATTCCA AATTATGCTA ATCTGCTTAA AGCCGTTATT
GCTAAGTCGC GGGCCGATTT TAACGGGTGG AGCCTGCCCT GGATGGTGGC CAGAGCGTCG
TTTAACGGGC GCATTACCAA CTCTGATGTT GTGGCTCAAC AGCAGGCCGT AATCGATACC
CCCGGTTTCA ATGTATTTCA GGGGCCGCTC AACGATACCA TTCAAAACCG GGCTGCGAAC
ACGGTTGATG TGCACTTTAG GAATGCGTCG CGCCTGTCGC CCCATCCGCA GTATTACCTG
AACAAACCCA TCCCGACCGA TATGGGCCTG TCGAGGTTTG CACGTAACTG GAACAACAGC
CTTAGTAATT CATTTTTTCA GAATGCGCAG CCCATTACCC CAACGCAATT CGCCGTGACG
GGTAACGTGG CGGCTTACGT ACCGCCCGGT GCAACGATCC CGGTGTCGTT TTCTACGTTG
GGTACCTTCA ATGCGGGGAA TCAATGGGAG GTGCAGTTGC TGGATTCTCT GGGGCAGTAC
ATTAGCGTGC TGGGTACCGG TACCGCTAGT CCAATCAGTG TAACATTACC CAATTCCCTG
CCGAGCGGCC GGTTTCAGCT TCGGGTGGTG GCTTCCTCGC CCGCCATGCC CGCCGTACCT
TCCAATCTTT TCCTGCTGAC AAACAAGGCC GATGTAAGTC TGGCAATGGG AATCAACCAG
CGAACGCCGG AGGTGAACAA GCCAGTGATC ATAAGTCTGG CGGTCCGAAA CGATGGACCG
GGTCTGGCCC GAAGTGTTGT CGTTCGCAAC CGGCTTCCTG CTAACCTGAC GTTTGTCTCG
TCGAGCGACT TTACGCTTAC CGGGGCTATC CTTACCGGTT CGGGCATCGA CGTTTTGCCA
GGTGCCACCC GAACGCTGAG CTTTACGGCC AAACCCACGC TGGCCGGTAT GTTTCAGAAC
GCAGCCGAGG TAGCGCAAAC CGCCAGCGTC GACACCGACA GTCAGCCCAA CTCCGGTACC
GGCGATGGCC AGGATGATGC CGCTCAGCTC GATTTTCGAA CTCGTCAGGT TAGTTCAGCC
GTTTTTACAT CCCCCAACCC GGACCAGGTG CCTTTGCCCG CCGTCAGCAG TAGTCAGCCC
ACTCCCGATC CGGCCAAGGC CGATGTCAGT ATTGCCCTTT CTGTCAATAA CCGGACACCT
GCGGTAGGTG ATGTCATTTC GTATACGTTA ACCGTAACGA ATCAGGGCGG ATTGACGGCA
ACCGGACTCA GTATGTCGGC CTATTTACCC GCCGGGCAGA CATTTGTTCC GGGTGATGAT
TTCATTCTGT CTGGTGGAAG TCCGGTTGTT GGCGTGAGCA GTTTAACAAT GGGAAGTAGT
CGTTCGCTTC TTTTTCGAGC CCGGGTTACC GCTGTAGGGC GGGGCGTTTG CACGGCTCAG
GTTGTAACCG CCAGCGTGCC TGACCCAGAC TCCGTACCCG GCAATGGCGT GACCAACGGC
GAAGACGATA CTGCTCAGGT CGATGTGCGC GTACGGTGA
 
Protein sequence
MLKFVKWAVL LIGWPLMAMA QLQISHPMAR LVVQRGTDGN GRLYLSGRFT GTVDKVEAQL 
TPAVAGQGVA TAWQTVQTSP TNNLFLGYVT AAGGWYVLTV RTLVGSTVVE QASVQPVGIG
EVFITAGQSN SRGLGIGDND LGTNTDRVNA IDSINHYYPQ PPSLPALVSS GDPMPVPRYK
ALTAARRIFP MAESSWGWGE LGDYIVNRFN VPVAFYVAGW DASTIDNWYK TANGIATCNA
YYCVGGDWPN LQPYTNLKNV LRYYGAVAGV RAVLWHQGEA EGDIAASSIP NYANLLKAVI
AKSRADFNGW SLPWMVARAS FNGRITNSDV VAQQQAVIDT PGFNVFQGPL NDTIQNRAAN
TVDVHFRNAS RLSPHPQYYL NKPIPTDMGL SRFARNWNNS LSNSFFQNAQ PITPTQFAVT
GNVAAYVPPG ATIPVSFSTL GTFNAGNQWE VQLLDSLGQY ISVLGTGTAS PISVTLPNSL
PSGRFQLRVV ASSPAMPAVP SNLFLLTNKA DVSLAMGINQ RTPEVNKPVI ISLAVRNDGP
GLARSVVVRN RLPANLTFVS SSDFTLTGAI LTGSGIDVLP GATRTLSFTA KPTLAGMFQN
AAEVAQTASV DTDSQPNSGT GDGQDDAAQL DFRTRQVSSA VFTSPNPDQV PLPAVSSSQP
TPDPAKADVS IALSVNNRTP AVGDVISYTL TVTNQGGLTA TGLSMSAYLP AGQTFVPGDD
FILSGGSPVV GVSSLTMGSS RSLLFRARVT AVGRGVCTAQ VVTASVPDPD SVPGNGVTNG
EDDTAQVDVR VR