Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4398 |
Symbol | |
ID | 5703447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4969987 |
End bp | 4971813 |
Gene Length | 1827 bp |
Protein Length | 608 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641273817 |
Product | hypothetical protein |
Protein accession | YP_001539166 |
Protein GI | 159039913 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.5486 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0412393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACG ACCATCCCGC CGGACCCTCG GGCCGGCACC TCGGCCGGCA CGGGCCGGAC GGCGGGGGCC CGGGCGAGGC GGTGCCCGGC TCCGGCCCGG CGCCCACGTC ATCGCCGAAC GGCCCCCATG CCAGTGAGGC GGCCGGCGGC CGGCCGGCGT GCCGGCCCCG CAGCACCGAC CCGGCCGAGT TGGGCTTCAC CCCACGCAAA CCGGTCCCGT GGCTGGCGCC GTTCCTGCTG GTCAGCACCG GCATCCGTAC GCTGCTCGCG CTGCTCTTCG GCGCCTACCT GGACAAACGA GAGCTACAGA CCGCCTTCGA CGCGAAGATC AGCCGGCAGG TTGGGCCAGA CGGTGGTGTC TGGCTGGACT ACGTCGCCGA CCTTGGTGAC GGCTTCGACG CCACCTACTC GGTCGCGTAC CTGCTGGCCC AGCGGGAGCT GATGGTCGAA GGACACCGGC TGCCCCGGGC GCAGGTGCTG GTGATGGGCG GCGACCAGGT CTATCCGTCG GCGGCCTTCG ACACATACGA GGACCGGTGC AAGGGCCCCT ACCAGGCAGC GCTGCCAGTC ACTCCACCCG AGCAGCCGAC GTTGTTCGCG ATCCCCGGCA ACCACGACTG GTACGACGGT CTCACCGCCT TCCTGCGGCT CTTCGTCCGG TCCCGGGACC GGCACTTCGG CGGCTGGAAC ACCGAGCAGT CCCGGTCGTA CTTCGCGGTG GAACTGCCGG CGGACTGGTG GCTGTTCGGC CTGGACGACC AGTCGGGTTC GTACCTGGAT GACCCACAGC TCACCTACTT CGACGACGTG GCCGAGCGGC TGGGGCCACA GAGTCGGGTG ATCCTGGCGG TGCCGATGCC GACCTGGGTC AAGGCCACCA AACACCCGAC GGCGTACGAC TCGATCGACT ACTTCATCCG CACCATCGTC GCGCCGACAG GGGCGCAGGT GCGGCTCCTC ATCTCCGGTG ACCTGCACCA CTATGCCCGG TACGCGGGGC CGGACCGTCA GCTGATCACC TGTGGTGGCG GCGGTGCGTA CCTCTACCCG ACGCACCTGT TGCCGGAGCG GATCCAGGTC CCACCGAAGG AGACGCTGGC CCGGCGGGCG AGTGCCACGC AGGTGTACGA GTTGGCGGGG CGATACCCCG ACGTGGCGCG GTCCCGGCGG TACGCCTGGG GCGCCTTTCT GCGGCTGCCG TTACGTAACC CGGGTTTCAC CACGCTGCTC GGTGCCCTGT ACGCGCTGCT GGTCCTGGCG ATGGTCGGGG TCTGCACGAA CCGCGATGAC GCCCAGCTGC GACTGTTCAG CGTTCCGTTG GCGGCGATGC TGCTGGTGAC CCTGCTCGGG GCGTTCTTCT TCGCCAAGCC GCCCGGTTCC GCGGGCAAGC GACGCCTTCG GCACTGGCTC CTCGGCGTGG GGCACGGTCT GGCGCACGTG GCGTTGGCGG CAGGCGGCAC GTGGGTGTGG CTGGCACTGC CGTTCCACGA CTGGCCGTGG CCGCTGTCGG TGGTCGCCGC GGTCGTGTTC TTCGGGTCGG TGGGCGGCCT GGCAGCAAGC CAGCTGGTGG CGGCGTACCT GCTGGTGGCC GGCGCGTTCG GGGTCAACGT CAACGAACTC TTCGCCGGTC AGGGCATTGA GGACGCGAAG GGTTTCCTGC GTATGCACAT CGCCCCGGAG GGGACGCTGA CGATCTACCC GATCGGGCTC GACCGGGTGG GTCGCCACTG GCAGGTCAAC CCCGACCTCT CCGCCGAGTC GTCGTGGCTG GTCCCGGGCA TCCCGCTGGA GCCTCGCCTG GCCGAGCCCC CGCTGGTCCT CCGCTGA
|
Protein sequence | MTDDHPAGPS GRHLGRHGPD GGGPGEAVPG SGPAPTSSPN GPHASEAAGG RPACRPRSTD PAELGFTPRK PVPWLAPFLL VSTGIRTLLA LLFGAYLDKR ELQTAFDAKI SRQVGPDGGV WLDYVADLGD GFDATYSVAY LLAQRELMVE GHRLPRAQVL VMGGDQVYPS AAFDTYEDRC KGPYQAALPV TPPEQPTLFA IPGNHDWYDG LTAFLRLFVR SRDRHFGGWN TEQSRSYFAV ELPADWWLFG LDDQSGSYLD DPQLTYFDDV AERLGPQSRV ILAVPMPTWV KATKHPTAYD SIDYFIRTIV APTGAQVRLL ISGDLHHYAR YAGPDRQLIT CGGGGAYLYP THLLPERIQV PPKETLARRA SATQVYELAG RYPDVARSRR YAWGAFLRLP LRNPGFTTLL GALYALLVLA MVGVCTNRDD AQLRLFSVPL AAMLLVTLLG AFFFAKPPGS AGKRRLRHWL LGVGHGLAHV ALAAGGTWVW LALPFHDWPW PLSVVAAVVF FGSVGGLAAS QLVAAYLLVA GAFGVNVNEL FAGQGIEDAK GFLRMHIAPE GTLTIYPIGL DRVGRHWQVN PDLSAESSWL VPGIPLEPRL AEPPLVLR
|
| |