Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0216 |
Symbol | |
ID | 5706120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 246774 |
End bp | 248444 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641269745 |
Product | hypothetical protein |
Protein accession | YP_001535142 |
Protein GI | 159035889 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0166769 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0154007 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGACGAA TCGATGATAT TGCGAACCGG TACGTGGCGG ACTGGGCCGC ACTGAACCCA ACCGGTGCCA CCTACGTCGG CATCCCGGGC CACGACGACC GGCTCGACGA TCTCTCGCCG GACGGGTACG CCAGCCGCAC TGACCTGACC CGCCGAGTCC TCGCCGACGT CGACGCCACA GAGCCGACGT CACCGGCGGA GCACACCGCG AAGGAGGCCA TGCAGGAGCG ACTCGGCCTC GAGCTCGCCC GCTACGAGGC GGGAGAGGTG GGTCGCGGAG TCAGTGTCAT CACCAGCGGT CTACACGAAC TCCGGTCCGT GTTCGACCTG ATGCCGACCG GCGGCGAGGG CGACCGGGCC AACATCGCCG CGCGGCTCAA CCGCTTCGCC GAAGCACTCG AGGGATACAA GACCACGTTG CGCGAGGCGA CCGACGCCGG CCAGGCCAGC GCCCAGGCGC AGCTACTCGA GGTGGCCAGG CAGTGTGACG TCTGGGTGGA CCCGGACGGC GACAACTTCT TCCACGGGCT GGTCGAGCGG CTGGACGAGG GGGGCACTCT CGGCGCCGAG CTGCGCCGGG GTGCCACGGC CGCCACCGCG GCGACCGCCG AGTTCGGCCG ATTCCTCCGT ACCGAACTGG CCCCACGGGG TAGGACGAAC CAGGCCGCCG GCCGGGAGCG CTACGAGCTG GCCTCGCAGT ATTTTCTCGG CGCGCGGGTC GATCTCGACG AGACGTACGC CTGGGGGTTC GAGGAGCTGG CCCGGCTCGA GGCGGACATG CGAACGGTGG CCGCGCGGAT CGTCGGTCCC GGCGCCACGG TCGACGAGGC GGTAGCCGCG CTGGACGCGG ATCCGGCGCG GACCATCCAG GGTAAGGAGG CGTTCCGGGA CTGGATGCAG GGCCTCGCGG ACAGGGCGAT CAGCGAGCTG CACGGCACCC ACTTCGACAT TCCGGAGCAG GTCCACCGGA TCGAGTGTTG CCTGGCGCCG ACGAGTGACG GCGCGATCTA CTACACCGGT CCGAGTGAGG ACTTCTCCCG CCCCGGCCGC ATGTGGTGGG CAGTGCCGCA GGGCATCAAC GACTTCTCCA CCTGGCGCGA GGTCACCACC GTCTACCACG AGGGTGTACC CGGCCACCAC CTTCAGGTCG CCCAGACCGC GGTCCGGGCG GAGACCCTGA ACCGCTGGCA ACGGTTGCTC TGCTGGGTCT CCGGGCACGG TGAGGGCTGG GCCCTCTACG CCGAGCGGCT GATGGAGGAA CTGGGTTACC TGGAGGACGC GGGCGAACGG CTGGGCATGC TCGACGGCCA GGCGCTGCGC GCCGCCCGCG TGATCGTCGA CATTGGCATG CACCTGGAGT TGGAGATCCC GACCGACAAC CCGTTCGGCT TCCACCCGGG CGAGCGCTGG ACACCGGAAC TGGGCTGGGA GTTCATGCGG GCGCACTGTC GGATACCGGA TGAGGTCCTG CGCTTCGAGC TGAACCGCTA CCTGGGTTGG CCCGGGCAGG CGCCGTCCTA CAAGGTTGGT GAGCGGATCT GGCTGCAGGC CCGGGCCGAC GCGAAGGCCC GCAAGGGTGC CGACTTCGAC CTCCGGGAGT TCCACCGGCA GGCACTCGAC CTGGGCTCAC TCGGCCTGGA CCCGCTGCGT CGGGCACTCG CCCGAATCTG A
|
Protein sequence | MRRIDDIANR YVADWAALNP TGATYVGIPG HDDRLDDLSP DGYASRTDLT RRVLADVDAT EPTSPAEHTA KEAMQERLGL ELARYEAGEV GRGVSVITSG LHELRSVFDL MPTGGEGDRA NIAARLNRFA EALEGYKTTL REATDAGQAS AQAQLLEVAR QCDVWVDPDG DNFFHGLVER LDEGGTLGAE LRRGATAATA ATAEFGRFLR TELAPRGRTN QAAGRERYEL ASQYFLGARV DLDETYAWGF EELARLEADM RTVAARIVGP GATVDEAVAA LDADPARTIQ GKEAFRDWMQ GLADRAISEL HGTHFDIPEQ VHRIECCLAP TSDGAIYYTG PSEDFSRPGR MWWAVPQGIN DFSTWREVTT VYHEGVPGHH LQVAQTAVRA ETLNRWQRLL CWVSGHGEGW ALYAERLMEE LGYLEDAGER LGMLDGQALR AARVIVDIGM HLELEIPTDN PFGFHPGERW TPELGWEFMR AHCRIPDEVL RFELNRYLGW PGQAPSYKVG ERIWLQARAD AKARKGADFD LREFHRQALD LGSLGLDPLR RALARI
|
| |