Gene Slin_4920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4920 
Symbol 
ID8728684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5990770 
End bp5992290 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content50% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003389697 
Protein GI284039767 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.404892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.563369 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTTT TGTTGGTATG GATTGCGTAT GGGGCTCTTT TTTGCTTTGA TAGCGAGTTA 
GTCTATGGAC AACATGTTAA TCAGTATCAG CTCGAAGTAA GCGGACTGGG TTCGTCCGAT
CAGACGCCCT TCTGGTTACG AGCCAATCAA TATGGTACGG TTCCGTTAAC AGGCCCGGCT
CTTCGACTAA ATGCGGGCTT ACATGCTGAT TACCGTCCTG CCGACAGTAC CGGCCATCGT
CCAAAAGCCG ATTGGGGGTA TGGCGTCAGC GTCGTGGCTA ATGTAGGGTC AACAAGCCAG
TTTCTCCTTC CCGAAGCCTA CATTAAAGGG CGGGTTGGTG CGTTTGAGCT TTACGTCGGC
CGTCGCAAAG AAATTATTGG GCTCGTGGAT ACACTGCTGA CGAGTGGGTC CTATATCTGG
TCGGGTAATG CGCTGCCTTT TCCTAAGATT CAACTGGCTG TACCGGTCTT TACATCCATC
CCGTTCACCA AAGGAGTGCT TTCTGTCATG GGCACTTTTT CGCACGGGTG GTTCGAGAAT
GGCGACCGAT TGGTGAAAGA TTCGTATCTC CATCAGTCTT CGGTTTATGG ACGTTTGGGT
AAACCGTCGT GGCGGGTTCG TTTCTACGGC GGATTCAATC ATCAGGTCAT GTGGGCAGGT
CATTCCGAAT TTATAGATCC TACCGTGGCA GCCAATGGTA AACTACCCTC AAACATCAAG
TACTATCCGG CAGTCGTATT AGGTACCCGA AATCCCTTCC CCGACGACCA GGCCATTCAA
ACGATAAGCC ATTTTGAAGA AAACAGGATT GGCAATCACC TGGGCTCTAT CGACTTTGCA
GCTGAGGTTA ACCTGAACCA CTGGAACCTG TTTGCCTATC GGCAATTTAT GTACGACGAT
GGCTCTCTAT TTTATGGTAC GAACCTGGAC GATGGTCTGA ATGGCCTCCG CATCCGAAAC
CGGGATCAAC TAACCGGGGC TGCTTTTTTT CTGAAGCAGA TTACAGTTGA GTATATGTTT
ACCGGGAGTC AGGGCGGTGA TTTGTTTATC CTTGACGATC CGCAAAAACG GGGGCGGGAT
GATTATTTCT CGAATAGCCA GTACCTCGAT GGCTGGACTT ATTTTGGCCG AACAATTGGC
ACGCCATTTA TTACCCCGCA AAGCGAAGTG CGGTCGTCCT TACCGCCCCG GTTTGGTATT
GCCAACAACC GGGTGAGCCT TTTCCACGTT GGGGTAAGTG CGTTGGTGTT GAATAAGGTC
GATATAACGA CACGTTTGTC ATTTAGCCGT AATGCAGGCT CTTATCCCAT TCCTTATTTA
ACGATACCAG CCCAGTTTTC AGGATTGTTT ACGGCGTCGG TTCCTATCGG TTTATTTGGA
GGAACCACCC TGAATGGGTC AATCGCGGTC GATTCGGGCG GGTTATTACC CAATAGCGTG
GGTACTTACG TGGCTTTGCG GAAAACCGGG CTGCTCGGTG GGAGCCGGCG CGCACCCGTT
CCCATACGTA GCGCCTATTA G
 
Protein sequence
MKFLLVWIAY GALFCFDSEL VYGQHVNQYQ LEVSGLGSSD QTPFWLRANQ YGTVPLTGPA 
LRLNAGLHAD YRPADSTGHR PKADWGYGVS VVANVGSTSQ FLLPEAYIKG RVGAFELYVG
RRKEIIGLVD TLLTSGSYIW SGNALPFPKI QLAVPVFTSI PFTKGVLSVM GTFSHGWFEN
GDRLVKDSYL HQSSVYGRLG KPSWRVRFYG GFNHQVMWAG HSEFIDPTVA ANGKLPSNIK
YYPAVVLGTR NPFPDDQAIQ TISHFEENRI GNHLGSIDFA AEVNLNHWNL FAYRQFMYDD
GSLFYGTNLD DGLNGLRIRN RDQLTGAAFF LKQITVEYMF TGSQGGDLFI LDDPQKRGRD
DYFSNSQYLD GWTYFGRTIG TPFITPQSEV RSSLPPRFGI ANNRVSLFHV GVSALVLNKV
DITTRLSFSR NAGSYPIPYL TIPAQFSGLF TASVPIGLFG GTTLNGSIAV DSGGLLPNSV
GTYVALRKTG LLGGSRRAPV PIRSAY