Gene Slin_2012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2012 
Symbol 
ID8725750 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2426680 
End bp2428035 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content52% 
IMG OID 
Productglutamate-1-semialdehyde-2,1-aminomutase 
Protein accessionYP_003386856 
Protein GI284036926 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.919837 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAAC GAACGAACCT ATCCGGAGGG ATTAATTCGT TCTTTTTATC ATTTGCCTTT 
CAAAAAATGA CAACGAGCGA ACAATTATTT GAAAAAGCGA AAACGCTGAT TCCCGGCGGT
GTAAACTCAC CCGTGCGGGC GTTTCGGTCC GTGGGTGGAT CGCCCATTTT TATCAAATCG
GCAAAGGGTC CCTACATTGT GGATGAGGAT GGCCGGCAAT ACATTGAATT GATCAATTCG
TGGGGACCCA TGATTCTGGG TCACGCTTTT GAGCCGGTCG AGAAAGCCGT CCGTGACGCA
ATTCAGCATT CGTTTTCGTT CGGAGCGCCC ACGCGCAAAG AAGTTGAGAT GGCTGAGTTG
ATTACCGCTA TGGTGCCCTC GGTCGAAAAG GTCAGGATGG TGAACTCCGG TACCGAAGCC
ACCATGGCGG CTATCCGGGT GGCGCGTGGC TTTACCGGCC GCGACAAGCT GATTAAGTTT
GAGGGGTGTT ACCACGGGCA CGGCGATTCG TTTTTGATCG CTGCCGGAAG TGGTGCCATG
ACTATGGGTA TTCCCGATAG TCCGGGCGTT ACTAAGGCAA CGGCTGCGGA TACGCTGACC
GCGCCGTACA ATGATCTGGC AGCTGTTGAA ACGTTGCTGG ATAATAACCA CAATCAGGTA
GCCGCCATCA TTCTGGAGCC GGTGGTCGGG AATATGGGGT GCGTACTGCC TGAACCGGGC
TTTCTGGAAG GCATCCGCTC GCTTTGCGAT AAGCATGGTG TTGTCCTGAT TTTCGACGAA
GTAATGACGG GCTTCCGGCT TGCCAAAGGA GGGGCTCAGG AACGCTTTGG CATTACCCCC
GACCTCACCA CCATGGGTAA AATTATTGGT GGAGGCATGC CCGTTGGTGC CTATGGCGGA
CGGGCCGACA TTATGGAAAT GGTGGCCCCG GCCGGTCCGG TCTACCAGGC CGGCACTTTA
TCGGGAAATC CGATTGCCAT GTCGGCAGGA CTGGCCATGC TTCATCACCT AAATGATCAC
CCGGAGGTAT ACACCCGACT GGAGACGATT GGTAAAAAAC TAACCGATGG GTTCCGGGAA
GGGTTGCAGA AGGCTGGCCT GTCGTATACC ATTAATCACA TTGGCTCCAT GTTTACCCTG
TTTATGACAA ACAGTCCTGT AAGCAACTTC ACGGAAGCCA AAACGTGCGA TACTCCGCTT
TTTGGCCGCT ATTTCCACGC TATGCTGGAA CGGGGTGTTT ACCTGGCTCC ATCCCAGTTT
GAGAGTCTGT TTCTATCCGT TGCTCTTACC GATGAACTCG TGGATCAGGT TATCCAGGCC
AACGAAGAAA GCTTACTGGA AATAATGAAT AAGTAA
 
Protein sequence
MSKRTNLSGG INSFFLSFAF QKMTTSEQLF EKAKTLIPGG VNSPVRAFRS VGGSPIFIKS 
AKGPYIVDED GRQYIELINS WGPMILGHAF EPVEKAVRDA IQHSFSFGAP TRKEVEMAEL
ITAMVPSVEK VRMVNSGTEA TMAAIRVARG FTGRDKLIKF EGCYHGHGDS FLIAAGSGAM
TMGIPDSPGV TKATAADTLT APYNDLAAVE TLLDNNHNQV AAIILEPVVG NMGCVLPEPG
FLEGIRSLCD KHGVVLIFDE VMTGFRLAKG GAQERFGITP DLTTMGKIIG GGMPVGAYGG
RADIMEMVAP AGPVYQAGTL SGNPIAMSAG LAMLHHLNDH PEVYTRLETI GKKLTDGFRE
GLQKAGLSYT INHIGSMFTL FMTNSPVSNF TEAKTCDTPL FGRYFHAMLE RGVYLAPSQF
ESLFLSVALT DELVDQVIQA NEESLLEIMN K