Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_03900 |
Symbol | |
ID | 8394282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | + |
Start bp | 461350 |
End bp | 464346 |
Gene Length | 2997 bp |
Protein Length | 998 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 644985154 |
Product | hypothetical protein |
Protein accession | YP_003142800 |
Protein GI | 257063128 |
COG category | [S] Function unknown |
COG ID | [COG4951] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.114045 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.665551 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACG GGATATCGAA ATACGCGTTG ATAAACGGTG CGGAAGCCTC ACTCATCGAG GCTATTCCCC ATGCCCTGAC GAATGATGCG GTCTGCGTTT TCGAAGATGC AGCAGGCAAG CGTCGGTACG TCTCGAAGGC CGAGTGGGTT GCGGGGGCTG ATGAGTTCAT CCATTATGCG GCGGAAAAAC GGCTGGTTAC CTCAGAAAGT CCTACCAGGG ACAAGATCGT CCTCTTCAAA TCGCTGTTCA AGGGCCGCGA AGATGTGTAC GCGCATGGCT TTGTCAAGCG AGAAGGCGGT ATCGGCTATG CTCCGAACTG CGCCAATGAA AGAACGCGCC GATGCCCGCG ATGGACGAAG GCCAACCCTG GCGTGAAGTG CACCGACTGC CCTGCGCGCC AGTTCATTCC CGTTAATGAC CGGGCAATCG TTGACCATCT CAAAGGCGAT CACGACGACC TGCGTGACGT GATGGGTCTT TATGTGCTGA CGCCGGATTG CAAGACATGG GTGCTCGTGG CCGACTTCGA CGAAGGCGGG TGGCAGCGCG AAACGGCGCT GTACTGCGAA TCTTGTCGCC AGTTCGGTCT GTTTCCAGCA GTCGAACGCT CTCGTTCTGG AAACGGTGCG CACGTTTGGC TGTTCTTCGA AGAGCCCATC GACGCCGAGC TGGCCAGAAG CCTTGGAGGC GTAATCATCA CGCACGCCAT GGACGAAGCG CCAGGCATGA CTTTCGAGTC GTACGACCGA TTCTTCCCAA CGCAATCCAC CATTCCACAA GGCGGTTTCG GCAACCTGAT TGCGTTGCCG TTGCAGGGTC GGGCGCAGCG ACAGTCGAAC TCCGTGTTCG TGGACGAGCG CTTCGACGCG TTTCAGGATC AGTGGCGGTT CTTGTCGTCC GTTTCGAAAG TGTCTGCCCA AAAGGTTCAG GAAATCGTCG GTTCCGCAGC TGACGGACCG TTAGGGCAGC TTGCTTTCGC AAGCGCTCCG TCTCGAAAGG CGAGTGTATC GGCGGGCCGC GAGGTCTCCG GTTTGTCGGA GCCCGCCACA AAGCCCGCAC CTGGCACCTT CCCCAAAACC TTGGACGTGA CCAGGGCGAA CATGCTCTAT GTGGCTAAAG AGGGTCTTTC CCAGAATGCG CTCAACAGAA TCAGGCGTCT GGCTGCGTTT GGCAATCCCG AGTTCTATCG GGCGCAGGCC ATGCATCAAT CCGTTTATGG AAAGCGCCGA ATCGTATGGT GCGGCGAAGA GGACGAGGCG TACATCATGC TGCCACGCGG ATGCGAGCAA AGACTCATCA GATTGCTGAG CGAGCATGGG TGCGTATGCC ATTTCGACGA CAAGCGCACC GATGGCGTGC CGATTGGCGC GACGTTTTCC GGCGTTTTGC GCGACCGGCA GCAACAGGCG GTGGATGCGT TGCTGCGCCA TGAGAGCGGC ATCCTGATGG CCCCGACGGG TTTCGGGAAG ACGGTGATCG GGGCATGCAT CATCGGTAAG CTGAAGATGC GCACCCTGGT CATCGTCCCG AAAACGAACC TTATCGACCA GTGGAAAACA CGGCTCGAAC AGTTCTTGGT TATCGAGGAC AATCGGCCTG CGCTGCTTAC GAAATCAGGC CGCCCAAGCA GGAGGAAGCG ACCCGTCATC GGTCAAATTG GTGGAGGGAA GAACGCGCCG AGCGGCATCG TGGACATTGC CACGTTCCAA TCTCTTTCCT TTAAGGACGA CTTGGGAATT CCGAGCGCAA AGCCCATCGT CGAGGATTAC GGGCTCGTCA TCTGCGACGA GTGCCATTAT GGGGCCGCTC CCAACCTGGA ATTGGTCATG AAGAACGTGA CAGCCAAGTA CGTCTATGGG CTGTCCGCCA CCCCAAAAAG AGCTGACGGT CTGGAGCGCA TCATCTATAT GCATTGCGGT CCCATTCGGC ATAAGGTCGA CCCGAAGGAG CAGGCTGCCG AGCAGGGGTT CGTGCGCACC CTGCAGCCGC GATTCACCCG TGTGAGGCTG GCATCGCTGG AGCCTGGTTT TTCGTTCAAC CAGGTCGTCG ATGCGCTTTG CGAGCACGCC GCTCGAAACG ACCTGATTGC GGAAGACGCG GCGGGTGCGG TGCGGGCTGG CCGAACGCCG CTTGTTATTA CAAAACGTAA AGAACACGCG TCTGAACTCG CTAAACGTTT GGAAGAAGCC GGTGTTACGA CCTATGTGCT TACGGGTGAG GGGACCGCAC GGGAAAAGCG CGACCGAATC GAGCGGGTGC GTAATGCGAC GGGCTCCGAT TATGCGATTG TCGCGACTGG CAGCTATATC GGTGAAGGTT TCGACTTGCC CCAGCTCGAC ACGTTGATGC TAGCGTCGCC CTATTCCCAT GAAGGGGTGA TTACACAGTA TTCAGGTCGT TTGCACCGTG AGAGCGAAGG CAAAACCGAC GTGATTGTGT ACGACTACGT CGACACCAGT GTGCCTATGC TTGAGCGCAT GTACAAACGA CGACTCAAGA CCTATGCGAA GCTTGGATAC ACGATTAAGG ATGCGACGGA GTTGCAAGGG CCTGGTGCCC GCATCGTGAC GGCCGAGTCC TGGCGCATGG ATTTTCTCGC GGACTTGTCC CAGGCGAGCA GGCGTGTCGT GATTTCGACG CCGTATGCGA ATCCGAATCT AGTGGACTCC ATGATGGCTG ACTTTAAGGA CGCGCTTGCC AGGGGCGTCG AAGTCGAGGT CGTGATGAGA AAGGCGAAGT CAGTTGCTTC TGTTGAGCTG CAGACGCGGA TTTCGGAGGG ACTTTCTTCC GCGGGGTGCA CGGTCACCGT TGAGGATGCG CCAGTTACCG GCGTTGCCAT TTTCGACGGC AAGGTGTCTT GGTATGGAAC GCTGCCGTTG CTGGCCTTTC CCAAAAACGA TGACTGCAGC TTGAGGGTCG ACAGCGCCGA AGTCGCCGCC GACCTGGCAG GGGCCGTCGG AATCCAGTCG GAACATGCCG ATATTGCGAC GGGGTGA
|
Protein sequence | MTHGISKYAL INGAEASLIE AIPHALTNDA VCVFEDAAGK RRYVSKAEWV AGADEFIHYA AEKRLVTSES PTRDKIVLFK SLFKGREDVY AHGFVKREGG IGYAPNCANE RTRRCPRWTK ANPGVKCTDC PARQFIPVND RAIVDHLKGD HDDLRDVMGL YVLTPDCKTW VLVADFDEGG WQRETALYCE SCRQFGLFPA VERSRSGNGA HVWLFFEEPI DAELARSLGG VIITHAMDEA PGMTFESYDR FFPTQSTIPQ GGFGNLIALP LQGRAQRQSN SVFVDERFDA FQDQWRFLSS VSKVSAQKVQ EIVGSAADGP LGQLAFASAP SRKASVSAGR EVSGLSEPAT KPAPGTFPKT LDVTRANMLY VAKEGLSQNA LNRIRRLAAF GNPEFYRAQA MHQSVYGKRR IVWCGEEDEA YIMLPRGCEQ RLIRLLSEHG CVCHFDDKRT DGVPIGATFS GVLRDRQQQA VDALLRHESG ILMAPTGFGK TVIGACIIGK LKMRTLVIVP KTNLIDQWKT RLEQFLVIED NRPALLTKSG RPSRRKRPVI GQIGGGKNAP SGIVDIATFQ SLSFKDDLGI PSAKPIVEDY GLVICDECHY GAAPNLELVM KNVTAKYVYG LSATPKRADG LERIIYMHCG PIRHKVDPKE QAAEQGFVRT LQPRFTRVRL ASLEPGFSFN QVVDALCEHA ARNDLIAEDA AGAVRAGRTP LVITKRKEHA SELAKRLEEA GVTTYVLTGE GTAREKRDRI ERVRNATGSD YAIVATGSYI GEGFDLPQLD TLMLASPYSH EGVITQYSGR LHRESEGKTD VIVYDYVDTS VPMLERMYKR RLKTYAKLGY TIKDATELQG PGARIVTAES WRMDFLADLS QASRRVVIST PYANPNLVDS MMADFKDALA RGVEVEVVMR KAKSVASVEL QTRISEGLSS AGCTVTVEDA PVTGVAIFDG KVSWYGTLPL LAFPKNDDCS LRVDSAEVAA DLAGAVGIQS EHADIATG
|
| |