Gene Slin_4719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4719 
Symbol 
ID8728483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5744295 
End bp5745464 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content52% 
IMG OID 
Productsignal peptidase I 
Protein accessionYP_003389496 
Protein GI284039566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.898955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAAAA AGGAAGCAAA GGCTGGCGCC ACATCGGCCA GGCCGAAGAA ATCTGCACTT 
CGGGAGTGGC TCGATTCGGT GCTTTTTGCG GTGATAGCCG CCACGTTGAT TCGTTTTTTA
ACGTTCGAGG CTTACGCCAT ACCGACCCCT TCGATGGAAA ATAGCCTCAT GGTTGGTGAT
TTTCTGTTTG TCAGTAAGCT TCATTATGGG ATTAGGACGC CAAAAACGCC CCTACAGGTG
CCACTGACCC ATCAGAAAAT CTGGGGTACG GAGATACCAT CGTACAGCAC GGCTATTCAA
CTGCCTATCT ACCGGCTGCC GGGTTTTACA ACCGTCAAAA ATGGCGATGT GGTGGTGTTC
AATTATCCGC CCCCCAAGGC AAACGAGCCC GCTTACCCAA CCGATCTGAA GACGAATTTC
ATTAAACGAT GCATCGGTAT TCCGGGCGAT GTGCTGACCA TCCGCCAGAC GCAGGTGTTT
GTAAATGGCA AACTATTGCC CGCGCCCGCC CGCTCCGAAA CGACCTACTT TGTCAAAACC
AGCGAAGTGC TCGACGACCG GTTCTTCCGG AAATACGACA TTGTCAATGA TTTCAAATCA
TCCGAGGGGC CCTTCATCAA CTGGCAGCCG CTGGAAGCCT ACAACGAGCA GACCAAACAA
TCGGTGCAGG TGGGTTACCG GGTAAATATG ACGGAAGAGG TCATGCAAAA ATTCAAATCG
TTCGACTGGG TGAAATCAGT AGAGCCGATG ATCGAACCGG CCGGTCAGGG TGGCCCCGGC
ATCATGGGCA GCGCGGCTTA CACCTGGAAT CAGGATAATT TCGGGCCGCT TACCGTACCA
AAAAAAGGAA TGACCATACC TGTTAATAAG CAAACGATTG CCGTGTATGG CGACATCATC
AAACGCTACG AAGACAACCG CGTAGTGGAC GTAACGCCAA CGGGTATCCG GGTAGACGGT
CAGCCGATTA CGACCTATAC CTTTAAACAG GATTATTATT TTATGATGGG CGACAACCGG
CACAACTCCG AAGATTCCCG CTACTGGGGC TTTGTGCCGG AAGATCATAT CGTGGGCAAA
GCCGTATTCG TCTGGATGTC ACTCGACCCC GTCCCGACCG ATATCTGGCA TAAAATCCGC
TGGAACCGGC TGTTTCGGTT GATTGGGTAA
 
Protein sequence
MGKKEAKAGA TSARPKKSAL REWLDSVLFA VIAATLIRFL TFEAYAIPTP SMENSLMVGD 
FLFVSKLHYG IRTPKTPLQV PLTHQKIWGT EIPSYSTAIQ LPIYRLPGFT TVKNGDVVVF
NYPPPKANEP AYPTDLKTNF IKRCIGIPGD VLTIRQTQVF VNGKLLPAPA RSETTYFVKT
SEVLDDRFFR KYDIVNDFKS SEGPFINWQP LEAYNEQTKQ SVQVGYRVNM TEEVMQKFKS
FDWVKSVEPM IEPAGQGGPG IMGSAAYTWN QDNFGPLTVP KKGMTIPVNK QTIAVYGDII
KRYEDNRVVD VTPTGIRVDG QPITTYTFKQ DYYFMMGDNR HNSEDSRYWG FVPEDHIVGK
AVFVWMSLDP VPTDIWHKIR WNRLFRLIG