Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2644 |
Symbol | |
ID | 5703589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 3011238 |
End bp | 3014168 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641272102 |
Product | hypothetical protein |
Protein accession | YP_001537472 |
Protein GI | 159038219 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00661972 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACCGTAC TATCCATGGC GGGACTGTCG ATGTCGATGC CAGTGGCCGG TGCCGAGCCG GCAGCGGCAG AACCAGTCCG GCCGGGCGCG AAGGCGGGCT TCACCGCCCC GGACAGGGCG CTGGGCGAGG ACTGGAAGAC CTCCAGCGAC ATCCTGGTCA CGGGGGCCGG TGACACCGAG GGCTATCACC TCTACGTCGC AAGAGAGTCG GCAGCGTTCG GGTGGGCCAC CCTTGCCACC CTGACATCCA GCGCGATCTC CGTCGGCCCC TGGACCGGCA ACGTCTGCGT GACCGGTTCA GGGCGCTACG CGATCGCCGT CTTCGCGCCG AAGAAGGCTG CCAACAAACC GGAACTGTCC CGTGCGGGAG CTCTGGCGGC CGTCGTGGAG ATCGCCACCG GCAAAGCGAC ACACGTCGCA ACCGGGGTCG AACTGGCGTA CTTCAATCCA GGGTGTGGTC CCGACGACCG AGCGTTGCTG ACCCGTTCGC TGGGTGACGA CTTCACGCTG TCAACCGAGT TACTGACAGT CGATGCAGCG AGTGCCAAGG TCATCGCGAC TCGCCGAGTC GAGGGCCAGC TCACGACGCC CGCTCCCGGG CCGAAAGGCG ACTACGGAAT TCTCGGCCGC CATCTGGTCC AGATTGACGA ACAGGGTCAG GCGATCCGAC AGGCCAGCCT GACCGGGCAG CCCTTCGGAC TTCAGGCCAC CGCGCGAAAT GGCGTCGATC TGGTCGCCGT ACACGGTGAC CGGGCGCTCG CCCAGCGATT CCACGATGGA AGACTGACCA CTGTCGCCAC CGGCCCTTGG GACAAGCTCC AGTTGTTCGG GCAACAGGGC GGTCACAACG CCCTGGTGGG TCAGGTGCAC GCACGTGCCA GCATGCCCGA GTTGAGGGTG GTGCAGACCG CGCGTCAGAT TCGAGCGCTG TCCGAAGAGG GACACCTTGC CGCGGCAGAG GTAGTCAGCC AGCAGAACAT GCGGGCGGCG GGTCAGCCGC TGCTTCCAGC GGACCCAGCA GACGCAGGTG ACGTGCGCGT TTCCGTACAG GCGACGGCTA CCGGCGAGAA GGCCACCCGC ACGTTCAATA CGACGGCCGC GCCCACACTC GATGTCGCGC TGAACACGAA TGCCCCCGCC GAAGCAGGGA CATCCCGGGT GGTCGATTCT GAGGCGTTGA CGCCAACCTG CGCGGTGCCG CGTAACGACC CCCGTGTGCA GCCTCTCCAA CCAAGTCCAG ATATGGTGGA GTGGGCCGTG AACCAGGCAG TGCACGGCCG GCTCAACGTC AACCGCCCGG CGAACTACCT GAAGGCCGGC CTACCGGCCT ACCAGCCACA ATCTTTGTTT CCAGCCCGTA CCCTGATAGG GGGCGGAAAG GTCCCAGCGC AGGTCATGCT TGCCATCCTG GCGCAGGAGA CGAACCTGTC CCAGGCATCG TGGCACGTGG TGCCTGGTGA CACGGGTAAC CCCCTGATCG CCAGCTACTA CGGCAACCAC GACAACCTCG ACGTGATTGA CTACAGCAAG ACCGACTGCG GGTACGGCAT CGGTCAGGTC ACCGACGGCA TGAGGGTAGG AAGTGCGCTG TTCACCGAAA CGCAGCGCAG GGCGATCGCG GTGGACTACG CGGCAAATAT CGCCGCCGGA ATGAACATCC TGATCGAAAA GTGGAATCAG ATGGCCGGCG AGTTGTCCGC GCACCAGAGC TACATGAACA ACAACGATCC GGCCTTCGTG GAGAACTGGT TCCTAGCGGC TTGGGCCTAC AACAGCGGAT ATTATCCGTA CACTACTCGC AACAGCGAGC TACAAAACGG CCGGTATGGT ATTGGCTGGT TCAACAATCC CGCCAATCCT CGATACCCCG CGAACCGCGC ACCATTCCTG CGGTTGACCC CAGCTGATGC GGAACGCCCC AACGAATGGG CTTACCCGGA GCGAATCATG GGCTGGGCGG AAACACCGCA GCTCAAAGGA TTTCCGGTAA TGACGCAGGC GTACGCGGAG CCGGATCATG GGGCGAACTC GCCGCGCACC GGACCTCAGG GGATCAATCA GGTTCTCTCA ATTCCGGACC GATACGAGTT CTGTTCGACG GTCAACAACT GCTCCGAAGC CACCAACGGA TGCCCGGCCG AGTCGGAGTT GTGCTGGTGG CACGGCGCGG CGAATTCGGG CAACTGTCCG ATGGACGAGT GCGCGAAGGA AAAGCTCACC TTCAGTGCAG GCGCCCCCGA GCCAGGAGTG AAGCGGATCT ATGAACGCAA CTGCGAGACG TTCACTGGCG AGAAAAACGG AAACCGGGAT CCAAGCCGAG ACGTCTCCGT GGTCTATACG TTGAACGACA CCGGACAGTA CAATCTCGGA TGCGACATTG GTGAGTCTGA CGGCAAGTTC ACGATTCGCC GGGGCCATCC GGCCGGCAGC GGCAGCAGCG CACCCTACGC CGAGATCGAT CTCCACCAGA TCGGTGCCGG CTACAAGGGA CACATCTGGT ACACGTACGT CAACCCGGGA AATCCCAAAC GCCGGATAGT TGGGTCCTGG ACCCCGAATC TCGATCTGGC TCCAGGAGAG AAGGCCCGCT ACGACATCGT CGCCCACGTA CCCAGCCACG GAGCCGACTA CGACGCGGTG GAATACCTCA TCACCCGAGG AGCCATCCTG GGGCAGGCGA CATGTGATAT CGATTTCGCC GAGGAGGCTG GCTGGTCGGT CTGGCCAGGC GTCCCTGACC CTAACCCTTT CAACCTGGGC GAGGATAAGT GGGTCTACCT GGGTTCCTAC GAGTTGGGCC GGGGTGCTCA GGTCCAGTTG AATAACATCG GTAACGAGAC TATCAATGGC TTCGACGCGG TCGCGTTCGA CGCAATGGCG TTCGTCCCGA TCGGGAACAA CCCGGGGCAC TCATGCGGTG ACGACTACTA G
|
Protein sequence | MTVLSMAGLS MSMPVAGAEP AAAEPVRPGA KAGFTAPDRA LGEDWKTSSD ILVTGAGDTE GYHLYVARES AAFGWATLAT LTSSAISVGP WTGNVCVTGS GRYAIAVFAP KKAANKPELS RAGALAAVVE IATGKATHVA TGVELAYFNP GCGPDDRALL TRSLGDDFTL STELLTVDAA SAKVIATRRV EGQLTTPAPG PKGDYGILGR HLVQIDEQGQ AIRQASLTGQ PFGLQATARN GVDLVAVHGD RALAQRFHDG RLTTVATGPW DKLQLFGQQG GHNALVGQVH ARASMPELRV VQTARQIRAL SEEGHLAAAE VVSQQNMRAA GQPLLPADPA DAGDVRVSVQ ATATGEKATR TFNTTAAPTL DVALNTNAPA EAGTSRVVDS EALTPTCAVP RNDPRVQPLQ PSPDMVEWAV NQAVHGRLNV NRPANYLKAG LPAYQPQSLF PARTLIGGGK VPAQVMLAIL AQETNLSQAS WHVVPGDTGN PLIASYYGNH DNLDVIDYSK TDCGYGIGQV TDGMRVGSAL FTETQRRAIA VDYAANIAAG MNILIEKWNQ MAGELSAHQS YMNNNDPAFV ENWFLAAWAY NSGYYPYTTR NSELQNGRYG IGWFNNPANP RYPANRAPFL RLTPADAERP NEWAYPERIM GWAETPQLKG FPVMTQAYAE PDHGANSPRT GPQGINQVLS IPDRYEFCST VNNCSEATNG CPAESELCWW HGAANSGNCP MDECAKEKLT FSAGAPEPGV KRIYERNCET FTGEKNGNRD PSRDVSVVYT LNDTGQYNLG CDIGESDGKF TIRRGHPAGS GSSAPYAEID LHQIGAGYKG HIWYTYVNPG NPKRRIVGSW TPNLDLAPGE KARYDIVAHV PSHGADYDAV EYLITRGAIL GQATCDIDFA EEAGWSVWPG VPDPNPFNLG EDKWVYLGSY ELGRGAQVQL NNIGNETING FDAVAFDAMA FVPIGNNPGH SCGDDY
|
| |