Gene Slin_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0049 
Symbol 
ID8723777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp63469 
End bp64644 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content55% 
IMG OID 
Productoxidoreductase domain protein 
Protein accessionYP_003384922 
Protein GI284034992 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0595236 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.22981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTAC CACAAGGCGC GAATCGGCGC AACTTTATTA AAGAAACCAT TGCTACAGCC 
GCTGGTCTGG TTGTATTGCC ATCTTTACCT ATTGAAACCA TTGGCGCACC GGCTATTCTG
CGGACGGGTT CGCTGGGTAG CGACCGGATG CCTACCGGAG CTGCCCGTAT CCGCTTTGCC
GTGATCGGCA TTAACCACGG GCACATCAAT AGCCAGGTGA ATGCGGTGAT CCGGGGCGGG
GGTGAATTTG TTTCGTTCTA CGCCAAAGAG CCCGATTTAG CGGCTGATTT TGCGAAGAAG
TTCCCGCAGG CAAAGCAGGT AAAGTCCGAT AACGAGATTC TGGACGATAA ATCCATTCAG
CTTGTGCTGA CCTCCGGCAT TCCCGAAGAA CGGGCACCAT TGGGTGTTCG CGTCATGAAA
GCCGGGAAGG ATTTTATGAC CGATAAACCC GGTATCACCA CCCTTGAGCA ACTGGCCGAG
GTGCGCAAAG TACAGAAAGA GACGAAGCGG ATTTACTCGA TCATGTACAG CGAACGGCTC
GAAAACAAGG CGACCGTGAA AGCGGGCGAA CTGGTAAAAG CCGGGGCTAT TGGCAACGTG
ATTCAGACCA TAGGACTAGG GCCGCACCGC ATGACGCCCA AAACCCGCCC CGACTGGTTC
TGGGACAAAA AGAAGTTTGG CGGTATCATC TGCGACATTG CCTCGCACCA GTTCGACCAG
TTTCTGTTCT TCACCGGCTC CAAAAAAGCG GATATTGTAG CGTCTCAGGT GGGTAATGTT
CACTTCCCGC AATACCCGAA ATTTGAGGAT TTCGGCGATG TGATGCTCCG GGGCGACGGC
GGCATGGGCT ACATCCGCGT CGACTGGTTC ACCCCCGACG GACTCAAAAG CTGGGGCGAC
GGCCGCCTGA CGATCCTGGG CACCGAGGGC TTTATGGAAC TACGCAAAAA CGTTGACCCT
GCCGGGCGCG AGGGTGGCAA TCACCTGATC ATCACCGACA ATAAAGAGTC TCGTTACATC
GACTGCAGCA ACGTGCCGCT ACCCTACGGC GAGCAACTGG TCAACGATGT AATTGACCGC
ACCGAAACCG CCATGCCGCA GGAGCATTGC TTCCTGGCCA TGGAATTAGC CATAAAAGCG
CAGAAGCAGG CGCAGGTAGT GAATTTTAAG AAGTAG
 
Protein sequence
MSLPQGANRR NFIKETIATA AGLVVLPSLP IETIGAPAIL RTGSLGSDRM PTGAARIRFA 
VIGINHGHIN SQVNAVIRGG GEFVSFYAKE PDLAADFAKK FPQAKQVKSD NEILDDKSIQ
LVLTSGIPEE RAPLGVRVMK AGKDFMTDKP GITTLEQLAE VRKVQKETKR IYSIMYSERL
ENKATVKAGE LVKAGAIGNV IQTIGLGPHR MTPKTRPDWF WDKKKFGGII CDIASHQFDQ
FLFFTGSKKA DIVASQVGNV HFPQYPKFED FGDVMLRGDG GMGYIRVDWF TPDGLKSWGD
GRLTILGTEG FMELRKNVDP AGREGGNHLI ITDNKESRYI DCSNVPLPYG EQLVNDVIDR
TETAMPQEHC FLAMELAIKA QKQAQVVNFK K