Gene Slin_5083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5083 
Symbol 
ID8728848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6216310 
End bp6218715 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content54% 
IMG OID 
Productputative aminopeptidase 
Protein accessionYP_003389857 
Protein GI284039927 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAC TCCTGTTAAT GGCGAGCCTG TCCCTCTGGT CGGCCGCCCT GATGGCCCAG 
GCACCCGCAG CGCCGACGCA GAACGCCAAC TCGCGTTTTG AGCAGCTTGG CCCCATGCTC
CCCACGCCCA ATACCTTCCG GACGGCCTCC GGTGCACCGG GAAAAGACTA CTTCCAGAAC
CGGGCTGATT ATGATATCAA GGTAGAACTT GACGACGCCA ACCAGAAGAT CATCGGTACC
GAAACGGTTA CGTATCATAA CAATTCCAGC GATGAACTTC CGTTTATCTG GCTTCAACTG
GATCAGAACC TGTTCGCCAA AGGCTCTACC GGCAGTGTAA CCCGCACGGG TGCTGTTAAT
GAAAGCGGGA TGAGCTTTGC ACAGTTGCAG AACCTCACCT CCGTTCGTGA GCGCAGCAGC
CAGCAGGCTT CGGATAAATT TGGTTATCAT ATCACCGCCG TGAAAGATGC CAAAACAGGG
AAAGCGCTTC AATACACAAT CAACCAGACT ATGATGCGAA TTGACTTGCC AGCGGCCATT
AAGCCCGGCG GCAGTTATTC GTTCAACGTA GACTGGAATT ACTTCGTAAC GGAGTATTAC
GGCCGGAGCG GGATGGAATA CTTTGCGAAA GATGGCAACT ACAACTACTT CATTGCCCAC
TGGTTCCCTC GTTTGTGCGC CTATAGCGAC GTGACGGGCT GGCAAAACAA GCAGTTTCTG
GGACAGGGCG AGTTTACCCT CATCTTCGGA AACTACAAAG TTGCCATTAC TGCTCCTAGC
GACCACGTAG TGGGTGCCAC CGGTGAATGT CAGAATTACA AGCAGGTGTT GACGGCTACG
CAGCAAAAGC GCATGGCGCA GGCCGCTACG TCCAAAACGC CCGTCGTCAT CGTTACGCAG
GATGAAGCCG AAGCAGCCCT GAAAACGAAG CCAACGGATA AAGTAGGCAA GAAAACATGG
GTATATGCCG CTCAGAACGT GCGCGACTTC GCGTTTACCA GCAGTCGTCG TTTTATCTGG
GACGCCATGC AAACCGATGT GTACGGCGAT GGCCATAAAA TCTGGTCGAT GTCGTTCTAC
GCTAAAGAAG GCAACCCGTT GTGGGGCCAG TATTCGACCC GCGTCGTTGA ACACACACTA
CGGTCATACG GAAACCGGAC CATAAAGTAC CCATATCCGG TCGCTATTTC CTGCCATGCT
ACGGCGGGTG GCGGTATGGA ATACCCCATG ATTTCATTCA ACGGCGGTCG CCCTGAACCT
GATGGCACTT ATTCGGAGCA GACCAAAGCC GGTATGATCG GCGTAATCAT CCACGAAGTG
GGACACAACT TCTTCCCCAT GATCGTCAAC TCCGACGAAC GGCAGTGGAC CTGGATGGAC
GAAGGACTGA ATACCTTCTG CCAGTATCTG GCAGAAAAAG AATGGGATTA CAACTTCCCG
AGTCGCCGGG GTGAGCCGCA GTACATTGTC GACTACATGA AGTCGGATAA AGCCGTACTG
TCGCCCATCA TGACCACATC GGACAACGTA ATCAATCTGG GCGCCAACGC CTATGCCAAA
CCGGCTACCG CACTGAACAT CCTGCGCGAA ACGGTTATGG GCCGTGAGTT GTTCGACTAC
GCGTTTAAGG AATACGCTCG TCGGTGGGCC TTTAAAACTC CCGAACCAGC GGACTTTTTC
CGTACACTGG AAGATGCCTC CGGCGTTGAC CTCGACTGGT TCTGGAAAGG CTGGTTCTAC
GGCGTCGAAC CCGTCGATCA GGATTTAGTC GAAGTAGACT GGTTCCAGGT TGATTCGGGC
AATCCGGAGG TGACGAAAGC CGCTGCCCGG GCCGAAGCCA AGCGCCGGGC CGGTACCATT
AGCAAGCAGC GCGATGCCGC TACGCAGGGC GAAACGGTCG TAGCGAAAGA TTCGACCATG
AAGGATTTCT ACAACAGCTA CGACCCCTAT GCTGTTACAG AGGCCGACAA GAAGAAATAC
CAGGATTACC TGGCGACCCT GAGCCCCGAC GAGCGGAAAA CACTGGAGAC CAACGCTGCG
ACCAACTTCT ACACCCTCTC ACTGAAAAAC AAAGGGGGCG TTCCGATGCC GGTAATTATT
CGGATGGAGT TCGAAGATGG TACCGACTCG GTAGCTCGTT TCCCGGCCGA AATCTGGCGC
TTCAACGATG TGTCGATCAA GAAAGTCATT GCGACGCCCA AGAAAGTGAA GCAATGGGTG
CTGGACCCTT TTCAGGAGAT TGCTGACATC GATACGGAGA ACAACGCTTT TCCACGAATG
GCACAACCAA CGCGGTTCCA ATTGTTCAAG CAACAGCAGC GTTTCGGTCC GCAGGGCCCG
AACCCCATGC AGCAGCAGAA GCGGGCTAAC CAACCCCCAG CTGTGCAGGG TAGTGGCAAG
AATTAA
 
Protein sequence
MRKLLLMASL SLWSAALMAQ APAAPTQNAN SRFEQLGPML PTPNTFRTAS GAPGKDYFQN 
RADYDIKVEL DDANQKIIGT ETVTYHNNSS DELPFIWLQL DQNLFAKGST GSVTRTGAVN
ESGMSFAQLQ NLTSVRERSS QQASDKFGYH ITAVKDAKTG KALQYTINQT MMRIDLPAAI
KPGGSYSFNV DWNYFVTEYY GRSGMEYFAK DGNYNYFIAH WFPRLCAYSD VTGWQNKQFL
GQGEFTLIFG NYKVAITAPS DHVVGATGEC QNYKQVLTAT QQKRMAQAAT SKTPVVIVTQ
DEAEAALKTK PTDKVGKKTW VYAAQNVRDF AFTSSRRFIW DAMQTDVYGD GHKIWSMSFY
AKEGNPLWGQ YSTRVVEHTL RSYGNRTIKY PYPVAISCHA TAGGGMEYPM ISFNGGRPEP
DGTYSEQTKA GMIGVIIHEV GHNFFPMIVN SDERQWTWMD EGLNTFCQYL AEKEWDYNFP
SRRGEPQYIV DYMKSDKAVL SPIMTTSDNV INLGANAYAK PATALNILRE TVMGRELFDY
AFKEYARRWA FKTPEPADFF RTLEDASGVD LDWFWKGWFY GVEPVDQDLV EVDWFQVDSG
NPEVTKAAAR AEAKRRAGTI SKQRDAATQG ETVVAKDSTM KDFYNSYDPY AVTEADKKKY
QDYLATLSPD ERKTLETNAA TNFYTLSLKN KGGVPMPVII RMEFEDGTDS VARFPAEIWR
FNDVSIKKVI ATPKKVKQWV LDPFQEIADI DTENNAFPRM AQPTRFQLFK QQQRFGPQGP
NPMQQQKRAN QPPAVQGSGK N