Gene Slin_4029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4029 
Symbol 
ID8727787 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4837876 
End bp4839303 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content56% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388818 
Protein GI284038888 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.500538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.302353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGTC GTACGTTTAT CCAACAATCT GCCGGGGCAA GTGCTGGCCT GATGGCTGGT 
CAGTCACTTT GGGCCGCACC AGCCGCCGAC TTTCCCGTTG TTCGGACAGC AGCCGCTCAG
CGCAATTTCA CCAGCTCGGC TGTAGAGCAG ACCATCCAGC GAATGCATAA AACTATTCGC
GACCCGGAAC TGGCCTGGCT TTTCGAGAAC TGCTTTCCCA ATACCCTCGA TACCACCGTG
CAGGTGAGTA CCAGCAATGG CCAACCCGAT ACGTTTGTCA TTACCGGCGA CATCGACGCT
ATGTGGCTGC GCGACAGTAC GGCGCAGGTA TGGCCCTATT TGCCGCTCAT TAAGCAGGAT
AAACCCTTGC AGCAACTGAT TGCGGGGGTT ATCCGTCGGC AATCCCTGTG CATTCGGCGC
GACCCCTACG CCAACGCCTT TTATGCCGAT GCCAGCAAAG AGGGCGAATG GAAGAAAGAC
GTGACAGCCA TGAAACCCGG TCTGCACGAA CGAAAGTGGG AACTCGATTC GCTTTGCTAC
GCCATCCGGC TGGGTTATCA CTACTGGAAA ACGACCGGCG ATACCAGCCC ATTCGATGCC
GACTGGCTAC AGGCCATGCA ACTCGTTTTG CAGACCTGCC GGGAGCAGCA ACGTAAAACC
AGCAGAGGGC CTTATAAATT CAGCCGCGAA ACGTCCTGGT CGACAGATAC CGTGCCGGGC
GATGGCTACG GGAATCCAAC GCGTCCGATT GGGTTGATAA ACAGCATCTT CCGCCCGTCT
GACGATGCAA CGGTTTTTCC GTTTTATGTA CCCTCGAACT GGTTTGCAGT GGTGTCGCTT
CGGCAACTGG CTACGATGGT CGATCAGATT CGGCCCACGC CTGCACTGGC TGCCGGTTGC
CGGGCTTTGG CCGATGAAGT GGAACGGGCG CTGAAACAGT ACGCCATTTA TACGCACCCC
AAATACGGGA AGATGTACGC CATGGAAGTA GATGGGTACG GAAATCACCT GCTTCAGGAC
GACGCCAACG TGCCCAACTT ACTGGCTTTG CCGTATCTGG GTGCCATGCC CGCCAGCGAT
CCGATCTATA AAAATACCCG TCGCTTTGTG CTCAGTCCGG ACAACCCGTA TTTCTTCAAA
GGAAAAGCGG CCGAGGGTGT CGGCAGTCCG CACACGCTGG TCAACAACAT CTGGACTATG
AGCCTGACCA TGCGCGCACT CACTTCCACC GACGATCAGG AGATTCTGGC GCAGCTTCGG
CTGCTGAAGA AAACCCATGC AGGCACGGGC TTCATGCACG AATCTTTCAA CCAGGACGAC
CCCGCTAAAT TCACCCGAAA GTGGTTTGCC TGGGCCAATA CCCTCTTCGG CGAACTGATC
CTGAAAGTGG CTAACGAACG TCCGCAGCTG TTGGATAAAG TCCTGTGA
 
Protein sequence
MNRRTFIQQS AGASAGLMAG QSLWAAPAAD FPVVRTAAAQ RNFTSSAVEQ TIQRMHKTIR 
DPELAWLFEN CFPNTLDTTV QVSTSNGQPD TFVITGDIDA MWLRDSTAQV WPYLPLIKQD
KPLQQLIAGV IRRQSLCIRR DPYANAFYAD ASKEGEWKKD VTAMKPGLHE RKWELDSLCY
AIRLGYHYWK TTGDTSPFDA DWLQAMQLVL QTCREQQRKT SRGPYKFSRE TSWSTDTVPG
DGYGNPTRPI GLINSIFRPS DDATVFPFYV PSNWFAVVSL RQLATMVDQI RPTPALAAGC
RALADEVERA LKQYAIYTHP KYGKMYAMEV DGYGNHLLQD DANVPNLLAL PYLGAMPASD
PIYKNTRRFV LSPDNPYFFK GKAAEGVGSP HTLVNNIWTM SLTMRALTST DDQEILAQLR
LLKKTHAGTG FMHESFNQDD PAKFTRKWFA WANTLFGELI LKVANERPQL LDKVL