Gene Slin_5152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5152 
Symbol 
ID8728918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6290429 
End bp6291949 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content51% 
IMG OID 
Productprotease Do 
Protein accessionYP_003389923 
Protein GI284039993 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000102653 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.175523 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCA ACTGGAAATT ATTAGCGTTG ATGGCACTCC TGTCGAGTGT AGTAACGCTG 
GCCGCTTACA ACCTGTTGGG ATTCAACAAC CGGGATGTAA TTCTTAACGA AGCGTCGCCC
ATTCAGCAGA TTACGGGCCG TCTGGCATCT ATGCCCGGTG GACCGAGCGC TGCACCCGGC
GACTTCTCTA CGGCTGCCGA AGCCGTAACA CCAATGGTTG TACACATTCG CACAACCATG
ACCCGTACTG TACGTCAGCA ACAGGTTCCC GACATTTTCC GGGAGTTCTT TGGCGATGAA
TTTGGTGGTG GTCAGCGGCA GCCCCGCCGT CAGCAGGGTC AGGCATCTGG CTCGGGCGTA
ATCATCAGCA AAGACGGTTA TATCGTAACC AATAACCACG TGGTACAGGA TGCCGATGAG
GTTGAGGTTA TCATGACCGA CAAACGCAGC TTTAAAGCGA AAGTAATCGG TACCGACCCA
TTGACCGACC TGGCCGTTAT TAAAGTAGAA GCCAACAATC TGCCAGCTAT TACGCTGGGT
GATTCCGACG CTCTGAAATT AGGCGAATGG GTTTTGGCCG TTGGTTACCC ACTTGATCTC
GAATCGACCG TTACGGCCGG TATCGTGAGC GCAAAAGGTC GCCGGATTGG TATCCTCGAC
CAGAACATTA GCAAAACGGA TGCGAAGCCT GATTCGCCGG TTGAAGCTTT CATCCAGACG
GATGCTGCCA TCAACCCTGG TAACTCAGGT GGTGCGCTGG TTAACCTGCG TGGCGAATTA
GTCGGTATTA ACTCGGCTAT TGCTTCGGCA ACGGGCTATT ACAGCGGTTA TGGCTTTGCG
GTACCTGTAT CGCTCGTGAA GAAAGTATCT GCCGACCTGC TCAAATATGG TAACGTACAA
CGCGGTTATA TCGGCATTCT GCCAATTGAA CTGAACAGCA CGGTAGCTAA AGAGAAAGGT
GCGAAAATTG GTCGTGGCAT CTACGTCGAG AGCGTTGTTG AAAAAGGTGC AGCTGAAGCC
GCTGGTCTGA AAAAGGGTGA CGTCATCGTG AAAATGGAAG GCCAGCCGCT TGATTCAGAT
GCGCAAATGC GTGAAATCAT CGGTCGTCGT CGTCCGGGCG ATGTGGTTAA TGTAACGGTT
AACCGGGATG GTACCGAGCG TGACTTTAAA GTCGAACTTC GTAACCGTAA TGGTGGCCGG
GATGTGATCA AGAAATCGGA CATTACCGCA GCCAATACCT CATTAAGTAC GCTGGGTGCC
AGCTTTGAAG AGCTATCAGC TCAGGAAGCA AAACAGCTTG GTGTTACCGG CGGGGTTCGG
GTCAAAAAAA TTACTGATGG TAAGCTGGCC GAAACTGATA TTGAGGAAGG CTTCATTATC
GTAAAGGCAA ACGGTAAGAA CGTCAAAACG ACGAAAGACC TGCAAGCCAT CATGTCGACC
GTTAAAGAAG GCGAAGGCCT GATGCTGATC GGCATGTATC CCAACAGCTC ACGGATGTAC
TACTACGCCG TTCCGGTGTA A
 
Protein sequence
MKSNWKLLAL MALLSSVVTL AAYNLLGFNN RDVILNEASP IQQITGRLAS MPGGPSAAPG 
DFSTAAEAVT PMVVHIRTTM TRTVRQQQVP DIFREFFGDE FGGGQRQPRR QQGQASGSGV
IISKDGYIVT NNHVVQDADE VEVIMTDKRS FKAKVIGTDP LTDLAVIKVE ANNLPAITLG
DSDALKLGEW VLAVGYPLDL ESTVTAGIVS AKGRRIGILD QNISKTDAKP DSPVEAFIQT
DAAINPGNSG GALVNLRGEL VGINSAIASA TGYYSGYGFA VPVSLVKKVS ADLLKYGNVQ
RGYIGILPIE LNSTVAKEKG AKIGRGIYVE SVVEKGAAEA AGLKKGDVIV KMEGQPLDSD
AQMREIIGRR RPGDVVNVTV NRDGTERDFK VELRNRNGGR DVIKKSDITA ANTSLSTLGA
SFEELSAQEA KQLGVTGGVR VKKITDGKLA ETDIEEGFII VKANGKNVKT TKDLQAIMST
VKEGEGLMLI GMYPNSSRMY YYAVPV