Gene Slin_4506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4506 
Symbol 
ID8728270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5463629 
End bp5464939 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content56% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389285 
Protein GI284039355 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA TCATAATTCC CATACTTCTC ACGGGCCTCA TGTTGACGCA GGTTGCCTGT 
AATAAAGAAT ACCTGAACCC GAGTACCATC AGCCAGCCGC AGGCGGTCAG TTCGCCCGAT
GGGTTGATGA CGCTCGCCAA TGGACTGCAG TACCGGTATA CCACCGGGGG GGCGTTGAGC
GTACTGTATG CCAGCACCGC CGGAGCGGGC CTGACCACCC GCGAGCAGGC CGTTCTGAAC
GTGGGGAACC TCGACGAGGC TAACCTCGCC GTTGGTGCCA GTACGGTAAG CAACATCAAC
AGTGTGGTAC GTAATCTGTG GACACAAAGC CACCTGGTAC GTTCCAACGC CGAACTGATT
CTGGCTAATA CGGGTGTTGT TTCGGATGCC GGTACGAAAA GCGGAATAGT TGGCTATGCG
TCCATTTTCC GGGCCCTTTC GCTGGGTACA CTCGGCATGT TTTTTGAGCA GTCGCCGGTA
ACGACGGGAA CGAACGTCGC GTTTGTTCCT CGGGTGCAAG TGTTCAAAGA AGCGGTTTCA
ACGCTCGAAA CAGCGGCCAC CCAACTAGCC AGTGCACCGG TATCGGCTGA CTTCACGGGC
AAAGTTGTTC CGGGGATCGA TATCGCAAAC ACGATTCAGG CCCTGATTGC CCGGTATTCT
CTGCTTTCGG GAGATTACGA CAAGGCGCTG GCTGCTGCGG CCAAAGTTGA CCTGACCAAA
CGATCTGTCT ACAACTTCGA CGATAACACC CGGAATCCGC TGTTTGAGTA CACGTTTGGC
AACCTGAACG TGTTTCAGCC CACTAACGCC AATCTGGGTC TTCCAGCCGC GCTGGCTCCG
GATGCTGCCG ATAAGCGAAT TGCGTTCCTG ACAAAGCCCA GCACCAACAC GGCCGTTGCG
CCGGTTATTG CCACCGCTTT CTATACGGCC AATAATGCTG CGGTTCCTGT TTATGTACCG
GGTGAAATTC TGTTGATTCA GGCCGAAGCG TATGCCCGGA AAGGCGATCT GACCAACGCC
GTTGCGGCAC TCAACAAGGT GCTGACGAAA ACAGCCGCTC AGGATGCGTT TGGTTTAGGG
GCAGCTTTAC CAGCCTATTC AGGAGCGCAG ACGGCCGATG CCATTCTGAC GGAGATTTAC
CGAAATCGCC AAATTGAACT GGCCTATCAG GGTTTCCGTC TGGAAGACAG CCGCCGATTC
AATCGGCCGG GGCCGGGTAC TACGGGTGCC GAGCGCAACC GCAATTTCTT CCCCTATCCG
CTGAATGAGC GTAACAACAA CACCAATACG CCACTCGACC CGGGGATTTA G
 
Protein sequence
MKKIIIPILL TGLMLTQVAC NKEYLNPSTI SQPQAVSSPD GLMTLANGLQ YRYTTGGALS 
VLYASTAGAG LTTREQAVLN VGNLDEANLA VGASTVSNIN SVVRNLWTQS HLVRSNAELI
LANTGVVSDA GTKSGIVGYA SIFRALSLGT LGMFFEQSPV TTGTNVAFVP RVQVFKEAVS
TLETAATQLA SAPVSADFTG KVVPGIDIAN TIQALIARYS LLSGDYDKAL AAAAKVDLTK
RSVYNFDDNT RNPLFEYTFG NLNVFQPTNA NLGLPAALAP DAADKRIAFL TKPSTNTAVA
PVIATAFYTA NNAAVPVYVP GEILLIQAEA YARKGDLTNA VAALNKVLTK TAAQDAFGLG
AALPAYSGAQ TADAILTEIY RNRQIELAYQ GFRLEDSRRF NRPGPGTTGA ERNRNFFPYP
LNERNNNTNT PLDPGI