Gene Slin_3489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3489 
Symbol 
ID8727242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4222576 
End bp4223682 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content53% 
IMG OID 
ProductCytochrome-c peroxidase 
Protein accessionYP_003388296 
Protein GI284038366 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.537332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCAG GGGATATACC ATCGATAACA CAACTTGGAC TGGTCGTCAG CGTACTACTG 
TTCTCGGTGG CCGTTGCCTG CCAATCCAGC AAAAACGTTG ATCCAACACC CGAGCCGCCC
CCAACCGGTG GTGGCATTCA ATACCAGACA ACGCCCGTAA CTCTCCGCAA ACCGGCTAAC
TTTCCGGATG TGGTGTATGA CCTGAGCAAG AACCCATTAA CAGTGGAAGG GGTAGCGCTC
GGCAAAACGT TATTTTATGA CCCGCTCTTA TCGCGCGATA CAACCATTAG CTGCGGGTTC
TGCCATCAGC AGTTCGCGGG TTTTGGCCAT TCCGACCATC CATTGAGTCA CGGAATAGGT
GGAAAATTTG GCACACGTAA TGTACCCGGA CTGGATAATC TGGCGTGGGG GCGCGAGTTC
TTCTGGGATG GGGGCGTAAC CAGTCTGGAC GAATTACCCA TATCGCCCAT CCAGAATCCG
GTGGAGATGG ATCTGAAATT TTCGGAAGCG CTGAACCGAG TGCAGAAAAA CCCCCGGTAC
CCGGCACTGT TCAAAGCCGC TTTTGGCTCC GATACCGTTA CAACGGCCCG TTTTCTCAAA
GCTGTATCGC AGTTTCTTCT GACAATGGTG TCGGCTGATT CGCGCTATGA TAAATACGTT
CGGAAAGAAG CCGGGGGCGA CTTAAACCCG GATGAGCTGG CCGGATTGAC GATATTTCAG
CAGAAATGTG CGACCTGCCA CGCGACCGAT CTGTTTACCG ACCGAAGCTA CCGAAACAAT
GGTCTTCCTG CGGGAGCCAT CAACGACCAG GGCCGGTATA CCATCACGCT GAACGAAGCT
GACCGGTTAA AATTCAGAGT GCCCAGCCTG CGAAATGTGG AGAAGACCTT TCCGTATATG
CATGATGGGC GGTTCGCAAC ACTAGACCAG GTACTCAATC ATTATACCAC AGGCGTCAAA
GACAGCCCCA CCCTCGACCC AGCCCTGAAA GCCAGCGGAC AACTCGGTAT TGCTCTCACC
GACACCGAAA AAAAACAGGT GATCGCCTTC CTGAAAACCT TGACGGATAA TACGTTTATC
AGCAATCGGG CCTTCAGTGC CAACTAA
 
Protein sequence
MRPGDIPSIT QLGLVVSVLL FSVAVACQSS KNVDPTPEPP PTGGGIQYQT TPVTLRKPAN 
FPDVVYDLSK NPLTVEGVAL GKTLFYDPLL SRDTTISCGF CHQQFAGFGH SDHPLSHGIG
GKFGTRNVPG LDNLAWGREF FWDGGVTSLD ELPISPIQNP VEMDLKFSEA LNRVQKNPRY
PALFKAAFGS DTVTTARFLK AVSQFLLTMV SADSRYDKYV RKEAGGDLNP DELAGLTIFQ
QKCATCHATD LFTDRSYRNN GLPAGAINDQ GRYTITLNEA DRLKFRVPSL RNVEKTFPYM
HDGRFATLDQ VLNHYTTGVK DSPTLDPALK ASGQLGIALT DTEKKQVIAF LKTLTDNTFI
SNRAFSAN