Gene Slin_5037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5037 
Symbol 
ID8728802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6141447 
End bp6142745 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content56% 
IMG OID 
Productamidohydrolase 
Protein accessionYP_003389813 
Protein GI284039883 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCGA TTAAATGGCT ATTGATGTTC TTGCTTGTAC CCGACAGCAT CTTGCTGTCG 
GTGGGACAAA CAAATTATAA TGTATCTCGC CGACAGCAAG ATGCTGTCGG GTACCTCCTC
CGTCCCGACC GGGTTTTCGA TGGCGAAACC ATGCATGAAG GCTGGGTCGT ACGGGTGAAA
GACGATAAGA TTGAGGCCGC TGGTCCGGCC AACAGTGTTT CGTCAACGGG GGCTACCGTG
GTTGATCTGA AAGGAACAAC ACTCATGCCC GGTTTGATCG AAGGACATTC GCATTTGCTT
TTGCATCCGT ACAATGAAAC GCCCTGGGAT GATCAGGTGC TGAAGGAAGC CCGCTCGCTG
CGCGTAGCGC GGGCCACCGT TCACGCCCGG AAAACACTGG AAGCGGGCTT TACTACCGTG
CGTGATCTAG GGACCGAAGG GGCCGATTAT GACGATGTCG GCTTGAAGCA GGCGATTAAT
AAAGGCATCA TACCCGGCCC GCGCATGATG ATCGTGACGC GGGCGCTCAT TGCTACCGGT
AGTTACGCGC CAAAAGGCTT CAGCCCGGAT ATCGACGTGC CGCAGGGAGC CGAAGAAGCG
GATGGCCACG ATGCGCTGAT TCAGGCCGTT CGGCGGCAGA TTGGCAAAGG AGCCGACGCC
ATAAAGATTT ACGCCGATTA CCGCTGGGGA CTCATGGCCG AAGCCCGCCC AACCTATACC
CTCGATGAAA TTAAGCTTAT CGTCGAAACC GCCCGAAGCA GCGGGCGGGG TGTGGTGGCT
CATGCCAGTA CGGCAGAGGG GATGCGCCGG GCTATTCTGG GCGGCTGCGA AACCGTTGAA
CACGGTGATG CCGGAACCCC CGAAATCTTT GCGCTTATGA AACAACATGG TACCGCCCTA
TGTCCAACAC TGGCCGCTGG CGATGCGATT AGTCAATACC GGGGCTGGAA AAAAGGGCAG
GATCCCGAAC CCGAGCGGCT CAAACAAAAG CGGGATATAT TCAAACAGGC ACTGGCTGCA
GGGGTTACCA TTTGCGCCGG GGGCGATGTG GGTGTATTCA GTCACGGCGA CAACGCTCGT
GAATTGCTGC TGATGGTCGA TTACGGAATG AAGCCAATCG ACGTGATGCG TTCGGTCACG
TCTATTAATG CCGACGTTTT CAAATTAACG GATCGGGGAC GCATTCGGCC GGGTTTGCTG
GCCGATTTAG TCGCCGTACA GGGCGATCCA ACAAAAACGA TCACCGATGT GCAACGCGTT
CAGATGGTTA TGAAAGGGGG CGTTTTTTCG AAGCGTTGA
 
Protein sequence
MISIKWLLMF LLVPDSILLS VGQTNYNVSR RQQDAVGYLL RPDRVFDGET MHEGWVVRVK 
DDKIEAAGPA NSVSSTGATV VDLKGTTLMP GLIEGHSHLL LHPYNETPWD DQVLKEARSL
RVARATVHAR KTLEAGFTTV RDLGTEGADY DDVGLKQAIN KGIIPGPRMM IVTRALIATG
SYAPKGFSPD IDVPQGAEEA DGHDALIQAV RRQIGKGADA IKIYADYRWG LMAEARPTYT
LDEIKLIVET ARSSGRGVVA HASTAEGMRR AILGGCETVE HGDAGTPEIF ALMKQHGTAL
CPTLAAGDAI SQYRGWKKGQ DPEPERLKQK RDIFKQALAA GVTICAGGDV GVFSHGDNAR
ELLLMVDYGM KPIDVMRSVT SINADVFKLT DRGRIRPGLL ADLVAVQGDP TKTITDVQRV
QMVMKGGVFS KR