Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4912 |
Symbol | |
ID | 5707400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5578662 |
End bp | 5580002 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641274306 |
Product | polymorphic outer membrane protein |
Protein accession | YP_001539651 |
Protein GI | 159040398 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0139202 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGTC CGCACCACGA ACTCGAGCGG CCCGGCGCAC GCCGCACCAG ATCCCGATGG TGGACTGTCG GGCTGGCCGG CCTGGCTCTG ACCACCACCG TGGGGGTCTC CGTCGCCCCG GCCGCTGACG CTGTCGCGCG CGCCGCCACC GACGACCGGC CCCCTACCAG CGACCACCGC GACGACAACC CAAGCGACAA GGGCAAGGGG AAGAAGAAAC CAAAGGGCGT CCCCGTCCCG TGTACGGCGG ACGCCCTGAT CGCCGCGATC ACCCAGGCCA ACGCCCGCGG CGGAGCCGTG CTCGACCTCG CCACAAAGTG CACCTACACC CTCACCACGA ACATCGACGG CGCCGGCCTG CCCGCGATCG CCACCCCCAT CACCCTCAAC GGCGGCACAC GCACCACCAT CACCCGCGCC GCCGCCGCCG ACCAATTCAG AATCCTCACC GTCAACGCCG GCGGCAACCT CACCCTCAAT CACCTGAAGA TCACCGGCGG ACAAACCACC GAAAACGGCG GAGGTGTCAG CGTCGCCGCC GGCGGCTCCG CCGCCATCAA CCACAGCAAG ATCCTGGACA ACATCACCAG CCAGGGCGGC GGCGGCATCT TCAACCAGGG AACCGCCTTC GTCACCGGCT CGAGCATCAA CCGGAACATC GCTGGTGAAA CAGGGGGTGG CATTTCCAGC AGCGGCCCAC TGAAAGTCGT CAAGTCCCAT ATAGATCGGA ACACCGCAAA CGTGGGCGGC GGTGTCGCTG CCGCCGGCGC TACCGTCAGG GAGGGAAGCA TAACCGGAAA CAAGGCAGCC CTCAGCGCCG GTGGCATGCT TATCGATGGC GGCATCGGCA CAGTGGTCGG CACCCACGTC ACCGGGAACA CCGCCGGCGA CGTCGTTGGC GGCATCGCGG TCACGGACGG CGGACAGCTG ACATTGCGGC GCGTCGGCCT CACCGACAAC ACCGCCGGCA ACGTCGGAGG CTTGTTCGTC TCCGGACCCA TCAGTAACGG CACTTCCTTT GCTGTCATCG AGGACAGCCT TATCAAGAAC AACAATGCCG GTTCCGACGG CGGCGGTATC TCCAATCAGG GGGTCACTAC CCTACGGCGC GCCACAGTCG CCGGCAACGA GGCAGGAGAC CAAGGCGGCG GCATCTACAA CACCCTCGGC GGCACCGTAC GCCTGTTCGA TACAAAGGTG GTCAAGAATG TTGCCGTCAC TGATGGTGGT GGCATCTTCA ACGACGGCGG CACGGTCGAG CTGAACATCG CCAGTGGCAC CGTTGTGGTC AAGAACCGGC CGGACAACTG CTCCGGCGAC GTTCCCGGCT GCGCTGGATA G
|
Protein sequence | MNRPHHELER PGARRTRSRW WTVGLAGLAL TTTVGVSVAP AADAVARAAT DDRPPTSDHR DDNPSDKGKG KKKPKGVPVP CTADALIAAI TQANARGGAV LDLATKCTYT LTTNIDGAGL PAIATPITLN GGTRTTITRA AAADQFRILT VNAGGNLTLN HLKITGGQTT ENGGGVSVAA GGSAAINHSK ILDNITSQGG GGIFNQGTAF VTGSSINRNI AGETGGGISS SGPLKVVKSH IDRNTANVGG GVAAAGATVR EGSITGNKAA LSAGGMLIDG GIGTVVGTHV TGNTAGDVVG GIAVTDGGQL TLRRVGLTDN TAGNVGGLFV SGPISNGTSF AVIEDSLIKN NNAGSDGGGI SNQGVTTLRR ATVAGNEAGD QGGGIYNTLG GTVRLFDTKV VKNVAVTDGG GIFNDGGTVE LNIASGTVVV KNRPDNCSGD VPGCAG
|
| |