Gene Slin_3381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3381 
Symbol 
ID8727134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4085582 
End bp4086949 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content55% 
IMG OID 
Productamidohydrolase 
Protein accessionYP_003388188 
Protein GI284038258 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.936479 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.571146 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACC TGCTTCTATT CATCATCCTG CTTCTCGCTT CGTGTGCTGC GCTTTCGGCC 
CAGGCCCTAA TCAGCAATGG CCAGCGCGAC ATTGTATTCA AATCCGTCAA TGTGATCCCC
ATGGACCGCG AGCGCGTGCT CGAAAATCAG ACGGTTGTGG TGCGGAACGG CCGGATAGCC
GCCCTGGGTA CGGCTGGAAA AGTAGCTGTC AGCAAAGATG CGCTGGTGAT TGACGCCAAA
GGGAAATACC TGACGCCGGG CTGGGCTGAA ATTCACGCCC ACGTGCCGCC CATCGACGAT
ATCGAGCCCA TGAAAGAGGT ACTGATGCTG TATCTGGCCA ACGGCATTAC GACCATTCGG
GGTATGCTGG GCCACCCCCG CCATCTGGAA CTTCGCAGTA AGATCAACAG CGGTGAAATT
CTGGGGCCAC ATTTTTACGC CACCGGCCCA TCATTCAACG GGCAAACCGT GAAGACCGCC
GAGCGGGGTG CCCAGATGGT TCGCGAGCAG AAAGCGGCTG GTTATGATTT TCTCAAACTG
CATCCGGGAC TCACCAAAGA GACGTTCCCG GCCATTGCCA AAACGGCGCA CGAAGTCGGT
ATTCCTTTTG TAGGCCACGT GTCGTTCAAT GTGGGTGTCT GGCGGGCAAT TGATGCGGAG
TACTCGTCCA TCGACCATAT GGACGGATTT GTGGAGGCCA TCGTTCCCCG TTCGGATACG
CTGGCCGAAC CCGAAACCGG CCTGTTTGCG TCCTGGATCG CCTACCGGGC CGATGCCTCG
CAGATTCCCA AACTTGTAAA GGGTCTGCGC GATAAGCATG TCCGAGTCGT GCCCACGCAG
GCTCTGGCCG AACGCTGGCT CTCGCCCTTA CCCGCCGATG CGTTTACGAA CGACCCCGAA
ATGAAGTATA TGAAACCGGA GCAGGTTACA AGCTGGGAGA ATACCAAAAA AAGCTACCTC
GCCAATCCAA ACTTCTCGAA AGAACATGCC GAAAAACTGA TTCAGATTCG CCGGAAGCTC
ATCTATGAAT GCCAGAAAAA CGGCGTCGAT ATTCTGTTAG GCTCCGATGC GCCCCAGATT
TTCAATGTGC CCGGCTTCTC CATCCACCAC GAAATGAAAT ACATGGTCGA CGCCGGACTG
ACGCCCTACG AAACCCTGCG GACGGGCACG GTCAACGTGG CATCTTACCT GAACAAACCC
GATTGGGGCG TCGTGAAGAC GGGAAATGTA TCGGACCTGG TGTTACTCAG TGGAAACCCG
CTGAAAGACA TCAGCCAGAC CAAAAACATT GAGGGCGTAA TGATGGGTAC GAACTGGCTG
TCGAAGGCGT ATATCCAGAA CGAGTTGAAA AAGCTGGAGA AACAGTGA
 
Protein sequence
MKHLLLFIIL LLASCAALSA QALISNGQRD IVFKSVNVIP MDRERVLENQ TVVVRNGRIA 
ALGTAGKVAV SKDALVIDAK GKYLTPGWAE IHAHVPPIDD IEPMKEVLML YLANGITTIR
GMLGHPRHLE LRSKINSGEI LGPHFYATGP SFNGQTVKTA ERGAQMVREQ KAAGYDFLKL
HPGLTKETFP AIAKTAHEVG IPFVGHVSFN VGVWRAIDAE YSSIDHMDGF VEAIVPRSDT
LAEPETGLFA SWIAYRADAS QIPKLVKGLR DKHVRVVPTQ ALAERWLSPL PADAFTNDPE
MKYMKPEQVT SWENTKKSYL ANPNFSKEHA EKLIQIRRKL IYECQKNGVD ILLGSDAPQI
FNVPGFSIHH EMKYMVDAGL TPYETLRTGT VNVASYLNKP DWGVVKTGNV SDLVLLSGNP
LKDISQTKNI EGVMMGTNWL SKAYIQNELK KLEKQ