Gene Slin_5235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5235 
Symbol 
ID8729001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6390051 
End bp6391328 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content49% 
IMG OID 
Productprotein of unknown function DUF349 
Protein accessionYP_003390005 
Protein GI284040075 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.809819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACG CTTCACTGGT AGATGAATAC GGTTACGTCA AAGACGGAAA GGTATTTTTG 
AAAGGCTACC TGAATTACGA AGACCGCCAG ATTGGCGAAG TCAAACGCAC CGAGCAGGAA
GCGCTCGATT ATTTCAAGAA TCGCTTCATC ATCGCGGAGA ACAAAGTCAG CCAGCTAGAA
AAAGATATTG AAGAGGCTCA GAACAAAGGC TCTTATCTAA CAAAACTGGT TCAGCTTCGT
AAGAAGCTCC TCGGTTTTGA CGCACTGGGC GATTTCCCTC CCCTGCTTGA GCGCCTGGAC
GAGCAGGAAA AACTGCTGGC TGACCTGATT ACGGTCAATC AGCTGAAAAA CCTTGAAATA
AAACGAGCCC TGATCGCCGA AGCTGAAGCT CTGGCCGATA GTACCGACTG GCGTAACACA
GCCGATGCCC TTCAGGAAAT CAAGGTGAAA TGGATCAAAA CGGGTCCTGT CGACAAAGCG
ACGGAGGCCG AAGTGGAAGG CCGGTTTCAG GAACTTCTGG ATGGCTTCTT TACCCGCCGA
CGTGAGTTCT TCAACGAACA GAACAAGGTT ATTCAGGAGC GGCTTGAAAA ATACGATGAA
CTGATTCGAC TTGCCTTCCG GGCCAACCGC CTGGGCGATC TGGATGCAGC TTTTCAGGAA
GTACGCAAAT TAAACAATGC CTGGAAACAG GTGGGTGAGG TGCCCATCAA AAAGAGCGGC
AAGCTCTACA AGCAGTTTAA GAAGGCGACG ACCATGTTCT ACGCCAAATA CAACGACGCG
AAGGGCATTG TAATCGTACC GAAGATCGAT CCTCGCATCG AGCAGCAGAT GAAAATGGCC
GACGAAGCCG AAAAGCTCTC GAAACAGTCT GATATTTTTG CCGCTGCCGA ACGGGCCAAA
GTGCTACTCA ATAGCTGGAA GGAAATTCGG GTGCCATTTA AGCTTCAGGA TAAAGTCGTT
AATGAACGCT TCCGGGCAGC CTGCGACAAG ATTTTCGAAC TTAGTTATCT GGGACGGGTA
TTGACGCGTA AATACCCGGC GTTTGAACTC AAAAGCCAGT CGGAACAGAT TCGGACGAAG
ATTCGGGAGA TGGAGTACCT GGTCAAGCGG GAGAAGAACG ACCTTCAGTT TGCTTTACAG
GACGCCGACG GTCTCGACCC GAACAAGGAC GAAGACAAGC AAATCCTGAA CAAGATCAAC
ACCCAGAAGC GCAAAATCGC CATGAAAGAG ACTATTCTTC GTGAGTTTCA GAAGCAACTG
GAAACAGCAG GCTATTAG
 
Protein sequence
MENASLVDEY GYVKDGKVFL KGYLNYEDRQ IGEVKRTEQE ALDYFKNRFI IAENKVSQLE 
KDIEEAQNKG SYLTKLVQLR KKLLGFDALG DFPPLLERLD EQEKLLADLI TVNQLKNLEI
KRALIAEAEA LADSTDWRNT ADALQEIKVK WIKTGPVDKA TEAEVEGRFQ ELLDGFFTRR
REFFNEQNKV IQERLEKYDE LIRLAFRANR LGDLDAAFQE VRKLNNAWKQ VGEVPIKKSG
KLYKQFKKAT TMFYAKYNDA KGIVIVPKID PRIEQQMKMA DEAEKLSKQS DIFAAAERAK
VLLNSWKEIR VPFKLQDKVV NERFRAACDK IFELSYLGRV LTRKYPAFEL KSQSEQIRTK
IREMEYLVKR EKNDLQFALQ DADGLDPNKD EDKQILNKIN TQKRKIAMKE TILREFQKQL
ETAGY