Gene Slin_4041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4041 
Symbol 
ID8727799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4857090 
End bp4858523 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content54% 
IMG OID 
ProductRagB/SusD domain protein 
Protein accessionYP_003388830 
Protein GI284038900 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.763305 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA TAGTGTTTGC GGTGGCCCTG CTGTTAGCCG GATGCCAGAG CGATTTTCTT 
GAAAAGCAAC CCCTCAACGC CGTTACGGCC GATAATTATT ACAAAACGTC GGACGATGCC
ATCCGGGCGG TCAATGCGGC TTATAAACCG CTTATGTACA ATGGCATTGG CCAGTTTGGA
ATTGCCTATT ACGGCAACAA CCAGGCGGGC GATTCCAATA CCTACGGTGA CGATGCCAAC
TGGGTAGCCG TCGAGAACTT TACCGTTACG GCCGATAATC CGGCCATTCG AGCCTCCTGG
ACGTCGTTTT ATCAGGTTAT CTTCCGGGCC AATCTGGTAC TTGACAAGGT TCCGGGCATC
ACGATGGATG CTACGCTAAA AGCCCGGATC CTGGCCGAAG CCTCGTTTTT ACGGGCAATT
TCCTACCACT ATCTGGTGTT GCTTTTCGGC GATGTGCCTT TGATAACGAA ACCCCAGTTC
AACGCCAGCG AGTTTCTGGT AGCACGAACA CCCGTTGAGC AGGTGTACAC GCAGATCATT
GCCGATCTGC AATCGGCGGA GAAGAGCTTA CCGCTCACTT ATCCCGCATC AGATCTGGGG
CGGGCCACCC AGGGTGCGGC CAAGTCGTTT CTGGCAAAAG TGTATCTGTA CCGGAAGCAA
TGGCCCGAAG CCGCAGCCAA AGCCAAAGAA GTTGTCGACT CGAAGGTGTA CAGTCTCTTT
GACCGCTATT ACGATAACTT CGAGCTGGCA ACCGAGAACG GGAAAGAGTC CATTTTTGAA
ATTCAGTATG CGTCATTTCT GGGCGGACTG GGTAACCAGA CCAACAACTA CGATGCTCCG
CGCGGATCGG GCTTCACGCT GGATGGGGGC TACGGCTGGG CGCAGCCAAC CCAGAACTTT
GTGAACTCAT TTGCCGCTAC GGACCCACGA AAAGGCTACA CTATTTTTCA GGCGGGCGAT
GTCTTTCAGG GAATAACATT TAACCCGGCC ACGTCCTCAA CGGGGTATGG TTCGCGCAAG
TACGTTGTGG GGAAAGGGAC AAACATCGGC AAGAGCGACG ACCCCAAAAA CTTTATCCTG
ATGCGCTATG CCGACTTGCT GCTGATGTAT GCCGAAGCCC TGAACGAGTC GGGAAAAACG
GCCGAAGCGC TGGCTCCCAT CAACCAGGTG CGGGCGCGCA AAGATGTGAA TATGCCGCCA
TTGGCCGCTA CGCTCAGCCA AACACAGCTT CGGCAGGCTA TTAAAGATGA GCGCCGGGTG
GAACTGGGTA TGGAAGGTCA CCGCTGGTTC GATCTGGTTC GCTGGGGCGA TGCCGCTGCG
TTTGCTAAAT CCATCGGCAA AACCACGTTC CGCGAGGGCA TCAGCGAGCA TTTTCCCATA
CCCCAGGCCG AGCGCGACAT CAACCCGAAC CTGACGCAAA ATCGGGGTTA TTGA
 
Protein sequence
MKKIVFAVAL LLAGCQSDFL EKQPLNAVTA DNYYKTSDDA IRAVNAAYKP LMYNGIGQFG 
IAYYGNNQAG DSNTYGDDAN WVAVENFTVT ADNPAIRASW TSFYQVIFRA NLVLDKVPGI
TMDATLKARI LAEASFLRAI SYHYLVLLFG DVPLITKPQF NASEFLVART PVEQVYTQII
ADLQSAEKSL PLTYPASDLG RATQGAAKSF LAKVYLYRKQ WPEAAAKAKE VVDSKVYSLF
DRYYDNFELA TENGKESIFE IQYASFLGGL GNQTNNYDAP RGSGFTLDGG YGWAQPTQNF
VNSFAATDPR KGYTIFQAGD VFQGITFNPA TSSTGYGSRK YVVGKGTNIG KSDDPKNFIL
MRYADLLLMY AEALNESGKT AEALAPINQV RARKDVNMPP LAATLSQTQL RQAIKDERRV
ELGMEGHRWF DLVRWGDAAA FAKSIGKTTF REGISEHFPI PQAERDINPN LTQNRGY