Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4871 |
Symbol | |
ID | 5707563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5522739 |
End bp | 5524403 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641274267 |
Product | hypothetical protein |
Protein accession | YP_001539612 |
Protein GI | 159040359 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.213174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.122523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCTGC GGGATCTGCT GGAATTCCAC GCCGACCCGT GGCATACGTC CGCGCAGGTG TGGCATCGGC TCGCTCAGGG CATCGACGAC ACGGCGGAGC AGGTGATCCG CAGTACGCGC GATGTCGGTG ACGCCTGGCC GGGCGGTGCC GGCTCGGCGG CAGCGGTGCG GAGGGCCACC GCGTTACGGG CTGAGGTGAG TAACGCGTAC AACCCGGCGA AACGCATCGC GGACGCGATG GAGCACCACG CGTACGCGAT GTCGGGGCTG CGGCGGCAGG CGGAGGAGAT CGTCGCCTCG GCGCGGCAGG CTGGCTACCA CGTCGACCTG GTGACCGGGG TGACCACCGC GCCACCCTCG GCGGACATGG CTGGCGGCCT GGATCGGTCC AGCCGATCGA CCGAATCGGT GCTCCAGTAT CTGCCGACGG TGGTGGACCA TGCCCGCGCG CAGGACGACG CGACAGCCAA CAGGATCTCC GTCAATGTGC CGTCGCCGGG GGCTGGCTTC GGGACCGGCC AACTCGACGG CGTTTCGCGC GCGGTGCTCG AGGCGCAGTC GGAACGGAGC CCGGCCGAGA TCCACGCGTG GTGGGAGTCG CTGACTCCGC TACAGCAAGA GCAGGTGTTG CGGGAGTTCC CGGAACTGGT CGGCCGGATG GACGGCATTC CGGTATCCGA CCGGGATGTG GCCAACCGCA GCGTCCTAGA GCGCGAACGC AGTCTGTTCC AGCAGCAGCT GAGCACGATC GAGGCCAGAG AAGATTTCCT GTGGATAATC CTCCAGCAGG GTCGTTTTTC AGAGGTCTAC CCGGACGCCG AGGACCCGAG GACTGCATTG GAAAACGAGT TGAGGAAGCT CGCGTCCGAG CGGGTTGAGC TGCCCGGCAA GCTGCGTGGC ATCGACGCGA TCACCGCTCG GCTGAATGAC GTCAGCCTCC CTGAGGCATA CCTGATCGGT TACTCCAGCG ACGGTGACGG CCGAGCGATC GTCTCGGTCG GTGACCCGGA CACCGCCGAC AACGTGCTCA CCTACGTACC CGGCACCGGC GAGCACCTGT CCAAGGTCGG TGCCGGCCTC GAGCGTGCCG ACATCATGGC CAGGGACGCA CTCAAAGCGG CCCCGGACGA GAACACCTCG GTGGTCTACT GGTATGGCTA CGACGCCCCG AACACGATTT TTCCTGATGC CGGCTTGGAC TCCTATGCCG AGGGCGGCGG CCCGCTCCTC GACACCTTCC AAACCGGACT TCGGGCTACT CACGACGGCG GCATCCCGTC GCACAACACC GTGCTCGGCC ACAGCTACGG CTCTACCGTG ATTGGCCATG CCGCTAAGGA AAGTACCTTT AACGCTGACG CTCTGGTGTT CGTCGGCTCT CCCGGCGTTG ATGTAAACCA CGCCTCTGAG TTAAATGGTG TGCGACCCGG TCAGGTCTGG GTTACTACAG CGGAGAATGA CATCATACGT CGGGTACCCG ACTGGGATTT TATTCATGGT AACGACCCCA GTGACCGCGA TTTTGGAGCA CGAGTCTTCG CCAGCGACCC CGGCAACCCT GACGACGAAG CAGGCACCCA CTCCGCCTAC TGGGACCAGG ACAACATCGC GCGAAAGAAC ATAGCGCGGA TCGTCACGGA CAGTCCCGTC CGCCTGCCTG AATAG
|
Protein sequence | MTLRDLLEFH ADPWHTSAQV WHRLAQGIDD TAEQVIRSTR DVGDAWPGGA GSAAAVRRAT ALRAEVSNAY NPAKRIADAM EHHAYAMSGL RRQAEEIVAS ARQAGYHVDL VTGVTTAPPS ADMAGGLDRS SRSTESVLQY LPTVVDHARA QDDATANRIS VNVPSPGAGF GTGQLDGVSR AVLEAQSERS PAEIHAWWES LTPLQQEQVL REFPELVGRM DGIPVSDRDV ANRSVLERER SLFQQQLSTI EAREDFLWII LQQGRFSEVY PDAEDPRTAL ENELRKLASE RVELPGKLRG IDAITARLND VSLPEAYLIG YSSDGDGRAI VSVGDPDTAD NVLTYVPGTG EHLSKVGAGL ERADIMARDA LKAAPDENTS VVYWYGYDAP NTIFPDAGLD SYAEGGGPLL DTFQTGLRAT HDGGIPSHNT VLGHSYGSTV IGHAAKESTF NADALVFVGS PGVDVNHASE LNGVRPGQVW VTTAENDIIR RVPDWDFIHG NDPSDRDFGA RVFASDPGNP DDEAGTHSAY WDQDNIARKN IARIVTDSPV RLPE
|
| |