Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1397 |
Symbol | |
ID | 5704083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1612919 |
End bp | 1614637 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641270907 |
Product | hypothetical protein |
Protein accession | YP_001536288 |
Protein GI | 159037035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0022466 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATCGAAG TGTCGGCGCA CCGGGTTGGT CAGGTCCTGA TGGTCGGTGC CGAGGCGGCG GTCGCTCGGG CCGCCGCACA CCTCACCCAA CCTCCCGCCG AACCCGGCCG GACCACCGTG CAGGTTGTCG GCGACGACCT ACGTGACTGG GAACAGCTCG CCCCGACCCT CGCCGCGCAC ACGTCGGAGG TGGGCACCTC GGTGCGCCTG GTGTCGCCGC CGGCGATGAA CCCGCCATCG ATAGCGTCCG CGCGTCGGCT GGGCGACCAG CTGGGCATCG AGGTGGTCGC CCCCGACGGG CCGCTCCTAC CGGCCCGCGA CGGCAGCATG TTCGTGCTCG GCCCCGACGC GTCCTGGTGG ATCATCGAGG CAGGCGCGGC GGCGCGGCAG GTCGGGCCGC ACCACCCCAC TCCGTGGTGG GCCGAGCATC GGCCCACCAC GGCCACCGAC TGGGTGACCA TTCCGGCCGG TGGGTGGCTG CCCGGCGGCG AGCGGCCGGA CGCCGCGGTG CCCGACGACC TCGTTCTCGC GGTACCGCGC CACGACTCGT TGTTCACGAT GGTGGTGGGT GCGCCCGATC AGCCCCCGGT GGCTCGGGAG ACGCTGCTGG CCAGCGTGGC TGACCTGCCG GCGCCCGTGC GGGAGCGGCT CCTCGTCGTG GCCTACGGCC CGGAGCAGGA CGATGCCGCC CCCGCGCTGG CCGCCACGTT CGGCGTCCGC GTCTACGGTG CCGACGGACT GCCCGGGTAC GGCCCGGCCG GCGAGATCGT TGTCCGAGCC GTGGCCGCGG ATGGCCGGCG TGGGCCGCGC CAATGTGCTC GGACCTTCGC TCAGGCCCCG GAAGAGGCCG AGGGGCGACC AGCATTGCGC GACGAACCAA GCGCTTCGCC GGACGAATGG CCGGCGGGGA CCTCCGCCAC GGACATTGCG GCGGAGGCGC CGTTGCCGCC GGTCCGGGTC CGGCCGGACC AGCGCAGCGC CGCACCGGAA CGCCATCGGC TGCGTGGGGC GCTCGGGGCG TACTACGACC TCCACGCGCG GGTGGTGGCG CGACTGTTGG CTCAGCACCC GGGGCTGCGC GTGCTCCCCG CCGGGGATGA GCCCCATGCG TTGATGACGG ACCTGGTCGC GGTGCGGGCG TTCCTCATGG GTGACCGCTC GTCCGTGACG GCCGCACTGC GTTCGATGTC GGACGTGGGT GACCCGGCCT TCCTCATTTG TCTGGCATCC GGCCTCAGGC GGCTACCCAG CTATCACGGG GTGGTGTATT CCTCTGTGCC GACAGAGCAC GCATCGCGCG TCTATCTGGA CGGCCGATCC ATCTGGGAGC CGACATTTCT GGAGGCGTCC ACAACTCGGG TTGCCGCGGG TGCCGAGATG ACGGATCTCA TCGTGTGGTC CACCAACGGT CGGCACGTCG GCGGAATCGT TGGTGGGGGA GACCACCACC GCGTGGTGTT CCCCGCTCGG TCCCGCTTTG TCGTGCTCGG CCACCGGCCG GCGGGGAGAG ACTGCTCCGC TGCGGTGTTC CTCCGTGACG TCCCTGCTGA GCCCGGGCAG ACCGAGGGGA CGACAAACCG CCGTATCCAC AAGCGACTCG AGGCGTTGAC GGCGGCTGGC GCGCGCGTGG GACGTCAGGC CGCGTCCGAT CCTGGCTGGG CGGTCAGTGG TGAGCTGCCG GGTTGTGACG AGACAGGGCG ACCGTATCGG TCGGAGTAA
|
Protein sequence | MIEVSAHRVG QVLMVGAEAA VARAAAHLTQ PPAEPGRTTV QVVGDDLRDW EQLAPTLAAH TSEVGTSVRL VSPPAMNPPS IASARRLGDQ LGIEVVAPDG PLLPARDGSM FVLGPDASWW IIEAGAAARQ VGPHHPTPWW AEHRPTTATD WVTIPAGGWL PGGERPDAAV PDDLVLAVPR HDSLFTMVVG APDQPPVARE TLLASVADLP APVRERLLVV AYGPEQDDAA PALAATFGVR VYGADGLPGY GPAGEIVVRA VAADGRRGPR QCARTFAQAP EEAEGRPALR DEPSASPDEW PAGTSATDIA AEAPLPPVRV RPDQRSAAPE RHRLRGALGA YYDLHARVVA RLLAQHPGLR VLPAGDEPHA LMTDLVAVRA FLMGDRSSVT AALRSMSDVG DPAFLICLAS GLRRLPSYHG VVYSSVPTEH ASRVYLDGRS IWEPTFLEAS TTRVAAGAEM TDLIVWSTNG RHVGGIVGGG DHHRVVFPAR SRFVVLGHRP AGRDCSAAVF LRDVPAEPGQ TEGTTNRRIH KRLEALTAAG ARVGRQAASD PGWAVSGELP GCDETGRPYR SE
|
| |