Gene Slin_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2149 
Symbol 
ID8725887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2605899 
End bp2607074 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content49% 
IMG OID 
Productmonooxygenase FAD-binding protein 
Protein accessionYP_003386976 
Protein GI284037046 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.438269 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.289576 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAC GAATCATCAT TTCAGGGGGC GGTATAGCGG GTCTAGCGGC CGCTATTCTT 
TTACAAAAAC AAGGTCACCA GATTATTGTG CTCGACAAAG TGAACGACTT CACCCAGGCC
GGCTTTCTGC TTTCCTTAAA AAGCTTTGGG GTGACCATCA TGGAAGAGCT TGGGCTTGGT
CAGCAGCTGC TGGATGCTTC CGCACCGTCA GAGTATATGA ACTTTGTGGA TTCGGATGGA
CAGCTGATTC GCCGTGTCAG TTACGAGAGG ATGAATCAGC AGATCAACCA GTCGATCCGC
CTGATAACCC GGGGCGGACT TCATCACCTA CTCGTTCATG CCATTCAGGA TAAGGTAACG
ATTCTGTTGG ATACCCGCCT GGAACAAGTA GAACAAATTG GGCAAACCGT CAAGGCTACC
CTGTCCAATG GCCAACTGAT TGAAGCGGAT CTTCTACTGG TTTCGGAAGG ACTACGATCA
ACAACCCGCA ACCGCTATAT AGCCGGCAGC CATGTGGAAG ACTTCAATGT TTTTTACATG
GGTGGTCGAC TAAACGAGCC TCATACCTAT CCTGTAGGTA GTTTCAAAAC GTTTATCGAC
GTCAACAAAA GTTTGGCTAT CTATCCGATC AGTTCTGATG AGCTGGCTAT GCAGTGTTAT
ATCTACAACA CCGATGAGGT AGCCCAACTC CAGGCCAAAA CAGATCAGCT GTTAACGGAG
ACCTTTAAAG GGTATGGTAG CGAGGTGCAA CAATTGATTG ACCGCTTCCT GCACCATGGG
CTGCTGTTTT CGGATAAAAT GGGGATGGCT CATGCCCCCA ATCTGGTCAA CAATCGGATT
GTGCTGGTCG GTGATGCCGG TTACTGTCCT ACTGCCTTGT CGGGCATGGG CGCTTCTTTA
TCACTGTACG GGGCTAAAGC ATTGGCGCAT TTTATTAGCC AATCCCCCGA TGAGATCAGC
TTGGCTTGTC AGCACTACAA TGCGTTGATG CAGCCGATCA TTGAAAAATT TCAGCGCAAT
GCCCGAAGCA ACGCCGAAAC GTTTCTGCCC CAAAATGAAG CTAGTTTGAA CGCGTTCACC
AGGTACTTCA GCACCGCATC CGAAGCCGAT TTATACCAGC GCATGACCGC TCAGCTGGTC
TTGACCGATG ACCAACTTCA TTTTTTCTAC CAATAG
 
Protein sequence
MRKRIIISGG GIAGLAAAIL LQKQGHQIIV LDKVNDFTQA GFLLSLKSFG VTIMEELGLG 
QQLLDASAPS EYMNFVDSDG QLIRRVSYER MNQQINQSIR LITRGGLHHL LVHAIQDKVT
ILLDTRLEQV EQIGQTVKAT LSNGQLIEAD LLLVSEGLRS TTRNRYIAGS HVEDFNVFYM
GGRLNEPHTY PVGSFKTFID VNKSLAIYPI SSDELAMQCY IYNTDEVAQL QAKTDQLLTE
TFKGYGSEVQ QLIDRFLHHG LLFSDKMGMA HAPNLVNNRI VLVGDAGYCP TALSGMGASL
SLYGAKALAH FISQSPDEIS LACQHYNALM QPIIEKFQRN ARSNAETFLP QNEASLNAFT
RYFSTASEAD LYQRMTAQLV LTDDQLHFFY Q