Gene Slin_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3958 
Symbol 
ID8727716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4749880 
End bp4750968 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content59% 
IMG OID 
Productpeptidase M19 renal dipeptidase 
Protein accessionYP_003388747 
Protein GI284038817 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.561135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATTA TTGATGCCCA CCTCGACATG GCCCTCAACG CCATCGAATG GAATCGCGAT 
TACCGCCTGT CAGCCCACCA GATCCGCGAA CTGGAAGCCG ATATGACCGA CAAAATTGAC
CGGGCAAAAG GCACCGTCTC TCTTCCCGAC CTCCGCCGGG GTAATATCGG TCTGGTCGTG
GCGACGCAGA TTGCCCGGTT CAACCAAAGC AACGGAAACC TGCCCGGCGC GGGCTGGAAC
TCCCCTGAAC AGGCCTGGGC CATGACGCAG GCACAGCGGA CGTGGTACGA AACGATGGTC
GACGCGGGCG AAATGGTGCA GATCACCGAC CGGACCAGCC TCGATAGCCA CGTGGCGCTC
TGGCTCGACG AAAGTATTCC CAACGACACC AAACCCGTCG GGTATATCCT CAGTCTGGAG
GGGGCCGACT CGCTGGTGAA CCTGTCGTAC CTGGAGAAAG CGTATAATTA CGGCTTACGC
GCCCTCGGTC CGGCGCACTA CAGCACGGGC CGTTATGCCC CCGGCACCGG CCTGAATGGT
CCGCTGACGG CGCAGGGCCG CGAGCTAGTG AAAGAAATGG ACCGGCTGGG CATTATTTTA
GATGCAACCC ACCTCACCGA CGAAGGATTT ACGGAAGCCC TGTCTTTGTA CAAGGGACCC
GTATGGGCGA GTCACCACAA TTGTCGGGCG CTGGTGCCGC ACCAACGGCA GCTCACCGAC
GATCAGATCA GGCAGTTGAC GGATCGGGGC GGGGTTATCG GCGGGTGTTT CGATGCCTGG
ATGATGAAGC CCGGTTTCAC CCAGCGCGTC AGCAATCCGA CCGAATTTGG CATTAGTATC
GAAACAATCA TCGACCACTA CGACCACATT TGCCAGCTCA CCGGCAGCAG CCAGCACATC
GCCATCGGCA GTGATCTCGA CGGCACCTAC GGCATCGAAC AATCGCCCAG TGACCTCGAC
ACCATCGCCG ACCTGCAAAG CCTGACCGGT TTACTAACGA AACGCGGCTA CACCCAGGAG
GATATTGAAA ATATTTTCCA CAAAAACTGG CTGCGGTTTC TGCGAGGGGC GTGGTCCCCA
GGCACCTAA
 
Protein sequence
MFIIDAHLDM ALNAIEWNRD YRLSAHQIRE LEADMTDKID RAKGTVSLPD LRRGNIGLVV 
ATQIARFNQS NGNLPGAGWN SPEQAWAMTQ AQRTWYETMV DAGEMVQITD RTSLDSHVAL
WLDESIPNDT KPVGYILSLE GADSLVNLSY LEKAYNYGLR ALGPAHYSTG RYAPGTGLNG
PLTAQGRELV KEMDRLGIIL DATHLTDEGF TEALSLYKGP VWASHHNCRA LVPHQRQLTD
DQIRQLTDRG GVIGGCFDAW MMKPGFTQRV SNPTEFGISI ETIIDHYDHI CQLTGSSQHI
AIGSDLDGTY GIEQSPSDLD TIADLQSLTG LLTKRGYTQE DIENIFHKNW LRFLRGAWSP
GT