Gene Slin_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2043 
Symbol 
ID8725781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2472073 
End bp2473989 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content56% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003386887 
Protein GI284036957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0081773 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.454673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTAC CCCTTACCTA TTCCGACTGG CACCAGTACA GCCTGACTCA TCCGGAAGCG 
TTCTGGGCAG AGCAAGCCAG GAATATTCCC TGGTTTACCC CGCCCACGCA AATTCTATCG
ACCGATGAAA ACGGCCTGAC GCGCTGGTTT GCGGATGGGC AGCTCAATAC CTGTTATGCC
GCTCTGGATT ACCACGTCGA GAATGGCCGG GCCGACCAGG TGGCCCTCAT TCATGATTCG
CCCGTTACAA ACACCATCCG GCAGTTTACT TACCGAGAAC TGCGCGATGA AGTGGCCCGG
CTGGCCGGCG TTTTGCAGAA ACTGGGCGTT ACAAAAGGCG ATACGGTCGT GATTTACATG
CCGATGATTC CCGAAACAGC CTTTGCCATG CTGGCCTGTG CCCGGCTGGG GGCTATTCAT
TCGGTCGTGT TCGGTGGTTT CGCACCCCAC GAACTGGCGC TGCGTATCGA CGATGCCCGC
CCCAAAGTTG TGCTGTCGGC ATCCTGTGGC ATTGAGTTTA CGACGATTAT TCCCTACAAG
CCCTTGTTGA ATGATGCCCT CCGGCAGGCC AGCTTCCAAC CCGAGCATTG TCTGATTTTG
CAGCGCCCGC AATGTGCCGC CGAGCTGCAA ATCGGCCGGG ATCACGACTG GCAGTTGCTG
GTCAGTCGGG CGCAACCCGC CGACTGTGTG CCCGTCAACG CGACCGACCC GCTCTATATT
TTGTATACGT CCGGTACTAC CGGCAAGCCA AAAGGGATCG TGCGCGACAA TGGCGGCCAT
GCCGTCGCTA TTAAATACAG CATGCAGGCC GTGTATAATC TACAGCCCGG ACAGGTGATG
TTTACCGCGT CTGATTTTGG CTGGGCGGTT GGGCATAGCT ATTCGGTTTA TGGGCCGCTG
CTTCAGGGTT GTACATCGGT TATTCTGGAA GGAAAACCCG TTCGAACGCC CGATGCCGGT
ACGTTCTGGC GGGTAGTACA GACCTACGGG GTCAATGTGT TGTTCACCGC GCCCACCGCT
TTTCGGGCCA TTAAAAAAGA AGATCCTGAG GCCGTACTCA GCCGTAACTA CGACCTGTCA
TCACTGCAAA GTGTATTCGT GGCCGGGGAG CGGTGCGACC CGCCAACCCT TCAATGGCTG
CAAGCTATCG TTCACGTGCC CGTCATCGAC CACTGGTGGC AAACCGAATC GGGCTGGCCT
ATGGTGGCAG ATCCGCTGGG AATCGAGGAG CTGCCGGTGA AACCCGGCTC GGCTACAAAG
CCCGTTTGTG GCTATGATTT ACAAATTCTG GATGAAGACG GGTACCAGCT AGGCCCAAAT
GAAATGGGAT TGGTTTGCCT GAAACTACCC CTTCCGCCCG GTTGCCTGCC ATCGCTCTGG
CAGGACGATG TGCGGTTCCG GGCTTCCTAC CTGAGCCGCT TCCCCGGTTA CTACCTGTCG
GGCGATGGCG GTTATGTTGA CGAAGATGGC TACGTCTTTA TCATGGGACG CGTCGACGAC
GTGATAAACG TAGCCGGACA CCGCTTGTCG ACCGGTGAAA TGGAAGAAAT AGTGAGCAGT
CATCCTGCGG TTGCCGAATG CGCCGTGGTA GGCATTGCCT GTCCGTTGCG GGGACAGCGC
CCCGTCGGTT TTATCGTTCT GAAAGACGGG TTTCAAATCC AGGAGACAAC CCTTGAAACG
GAATTGGTAA CGCTCATCCG CGATCAGATC GGAGCGGTCG CCTGTTTTCG AAATGCGCTG
ACGGTGAAAC GCTTACCCAA GACCCGGTCC GGTAAGATTC TCCGGAAGAT CATCCGGTAC
ATAGCGGATG GCGAAGCCTA CACAACCCCG GCTACTATTG ATGATCCACT GATTCTGAAC
GAGATAAAAG ACGCATTGCT CAGGCGCAGA ATCGGTCAGC CGTTCGAAAT TGATTGA
 
Protein sequence
MNLPLTYSDW HQYSLTHPEA FWAEQARNIP WFTPPTQILS TDENGLTRWF ADGQLNTCYA 
ALDYHVENGR ADQVALIHDS PVTNTIRQFT YRELRDEVAR LAGVLQKLGV TKGDTVVIYM
PMIPETAFAM LACARLGAIH SVVFGGFAPH ELALRIDDAR PKVVLSASCG IEFTTIIPYK
PLLNDALRQA SFQPEHCLIL QRPQCAAELQ IGRDHDWQLL VSRAQPADCV PVNATDPLYI
LYTSGTTGKP KGIVRDNGGH AVAIKYSMQA VYNLQPGQVM FTASDFGWAV GHSYSVYGPL
LQGCTSVILE GKPVRTPDAG TFWRVVQTYG VNVLFTAPTA FRAIKKEDPE AVLSRNYDLS
SLQSVFVAGE RCDPPTLQWL QAIVHVPVID HWWQTESGWP MVADPLGIEE LPVKPGSATK
PVCGYDLQIL DEDGYQLGPN EMGLVCLKLP LPPGCLPSLW QDDVRFRASY LSRFPGYYLS
GDGGYVDEDG YVFIMGRVDD VINVAGHRLS TGEMEEIVSS HPAVAECAVV GIACPLRGQR
PVGFIVLKDG FQIQETTLET ELVTLIRDQI GAVACFRNAL TVKRLPKTRS GKILRKIIRY
IADGEAYTTP ATIDDPLILN EIKDALLRRR IGQPFEID