Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3647 |
Symbol | |
ID | 5703341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4208848 |
End bp | 4210242 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641273072 |
Product | hypothetical protein |
Protein accession | YP_001538436 |
Protein GI | 159039183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.279677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00645734 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAACCAGA CCCACAAACC CGACCGCCTC GGCGGGGACC GGGCGAGGTC ACGGCAGCGG TGGTGGGCTG TCGGGCTGGC CGGCGTGACC GGCCTGGCCC TGACCACCGT CGGCGTCGCC GCCAGCCCGG CCGCTGACGC CGCCGAGCGA ACCCTGACCG CTGTCGACGA CCGCGGCGAT CGCGACGACG ACCAGCGCGA CAACGGGCGT ACGCACGGCG ACAGCGGCAA GGGGAAAGCC AAGCCGAACG CGATGGGCAT GCCGGTCCCC TGCGACGCGG ACACACTGAT CGCCGCGATA ACCCTGGCCA ACGCCCGGGG TGGCGCCGTC CTCAACCTCG CCAAGGGCTG CACCTACCTG CTCACCGCCG ACCTCGACGG CGCCGGCCTG CCCGTCATCA CCACCCCCAT CACGCTCAAC GGCGGCACGA GCACCACCAT CAAACGCGAC TCCGCCGCCG AACCGTTCCG GATCCTCACC GTCGACGCCG GCGGCGACCT CACCCTCAAC CACGTGACCG TCACCGGCGG CCAGACCGAG GGAACCAACA ATGGTGGCGG AATCCTCGTG AACAGCGGAG GCGCCCTGGC GATCAACCAC AGCGCCATCA CGCACAACAT CGGCAACAAC GGCGGTGGCG TGGCCAATCT AGGTAGGGCC ACCGTCACGC ACTCCAGGGT GAGTGGGAAC ACTGCACGGG TCAGCGCGGG CGGCCTCCGC AACGCGGGCC GGCTCACCGT CGACCATTCC GCCATCACGG GCAACACCGC CAACGCCGGA GTCGGAGTGG GATTCGGCGG GGGGATCGGC AGTTTCGCCG GGGGAGTCAC GGTCATCAAC CGCAGCAGCA TCACCCGGAA CCACTCAGTT GTCGCCGGAG GAGGCATCGG CGACTTCAAC GCGACCACCA CTGTCACCGG CTCCACCATC GTCGGCAACA CCGCAGCCGC CACGGGTGGC GGTGTCTTCA CCGAAGGGCG TCTCACCCTC CGACACGTGA AGCTCGTCGG TAACCACGCC TCCAGCGATG GCGGCGGCGG CTTCAACATT CAGAACATTC TCGGCACCAG CGTCGCGACC ATCGAGGACA GCAAAATCGC CAACAACAGT ACGCGTGGCG TTGGTGGAGG GATTCGCAAC CTCGCCGCCA CGATCCTGCT ACGAAACGCG CGGGTCGCCG GCAACCAGGC GGACAATGGT GGCGGTGTCT TCAACAACAG CACCGCCACG CTCACCCTGC TCTCCACCAA AGTTGTTGAG AACACCGCCG TAACTGACGG TGGCGGCATC TTCAATATTG TCGGTGGCAC CGTAAACCTC AATACGGCCA CTGGCACCAC GGTGATCAGG AACCGGCCCA ACAACTGCGT CGACGTCCCC GGTTGCCCGG GCTGA
|
Protein sequence | MNQTHKPDRL GGDRARSRQR WWAVGLAGVT GLALTTVGVA ASPAADAAER TLTAVDDRGD RDDDQRDNGR THGDSGKGKA KPNAMGMPVP CDADTLIAAI TLANARGGAV LNLAKGCTYL LTADLDGAGL PVITTPITLN GGTSTTIKRD SAAEPFRILT VDAGGDLTLN HVTVTGGQTE GTNNGGGILV NSGGALAINH SAITHNIGNN GGGVANLGRA TVTHSRVSGN TARVSAGGLR NAGRLTVDHS AITGNTANAG VGVGFGGGIG SFAGGVTVIN RSSITRNHSV VAGGGIGDFN ATTTVTGSTI VGNTAAATGG GVFTEGRLTL RHVKLVGNHA SSDGGGGFNI QNILGTSVAT IEDSKIANNS TRGVGGGIRN LAATILLRNA RVAGNQADNG GGVFNNSTAT LTLLSTKVVE NTAVTDGGGI FNIVGGTVNL NTATGTTVIR NRPNNCVDVP GCPG
|
| |