Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4537 |
Symbol | |
ID | 5705978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 5129715 |
End bp | 5131535 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273951 |
Product | hypothetical protein |
Protein accession | YP_001539300 |
Protein GI | 159040047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.483456 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0103293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACAG CCGATTCGAG CCGCGCGGCC GACGCTCCCG AGGGCGGGGC GTCGACCACC GAGCCTGACC AGCGAAACCA GGCCACCACA CGGTGGCGGG TCGATCTGCT GGCGGTGCTG AGCTTCCTCG CGCTCGCACT CTGGGTGACC CTCCGGCTCT GGCTGGACCC ACGCGACGGG CTCCGGGACA ACCGCACCGA TCAGGCGCAG TTCGAGTGGA TGATGGCGCA CGGTTCACGA GTGGTGACCG ATTTCGCCTA TCCCTTCGCC TCGGATCGAA TGAACGTGCC CGAGGTCGTC AATTTGATGG CCAATACGTC CGTATTATCT GTATCTATAC CAATGACGCC GGTCACCCTT GTGGCCGGAC CCCGGATGTC CTTCCTGCTC TTTCTCACCC TGGGGATGGC CGCCACCGCA ACATCGTGGT ATTTCCTGCT GTCCCGGGTG GTGGTCCGGT CTCCCGGCCC GGCCTGGCTC GGCGCCACAT TCTGCGGGTT CGCGCCCGCC ATGGTCTCGC ACGCCAACGC CCACCCCAAC ATCGTCTCCC AGTTCGTGGT GCCACTGATC ATCTGGCGTA CCCTGCGCCT CGGTGAGCCG GGCCGCTGGC TACGCAACGG GCTGCTGCTC GCCCTGGTGA TCGTCTGGCA GGCATTCCTC AACCTGGAGA TCCTGCTGAT GACCGCGATC GGCCTCGGTG TGGTCATCGT CGCGCTTGCC CTCGGTCGAC CCGACCTACG CCAGCGGACA CGCCCGTTCC TCGCCGGGCT GGGCGTCGCC GCGGGAGTCA CGCTCGTCCT GCTGGCGTAC CCGCTGTACG TACAGTTCTT CGGTCCCGGC GCCTACCGGG GGCTGTCACC CCTCATCCGC GGCTACTCCA CCGACCTCGC CTCGTTCGTG GCGTACTCCC GGGAGTCGCT GGCCGGCGAC GAACCCGGTG CGAGAGGGCT GGCGAAGAAC CCCACCGAGG AGAACGCCTT CTTCGGCTGG CCCCTGTTGG TGCTCGTCGC CGCACTCGTC TGGTGGCTGC GCCGCAACGT CGTCGTCCGG GCCCTGGCCC TGCTCGCGGT GGTCTTCGCC GTGCTCTCGC TCGGCCGGGA AGTCCTGTTC AACGGCGAGG CCACCGGTAT ACCGGCTCCC TGGGCGATAC TGGAAACCCT GCCGATCCTG CACTCGGTGG TACCGACCCG CTGGGCCCTG GCCATCACCC CGGTGATCGG GCTGCTGCTC GCGTACGGGG CACAGCACGC CCGCACCCTC GCCACCCGGA ATCCGTCCGC CCGGCCACAG ATCCGCTTTG CCACGGTCAC CGTACTGGCG ATGGCGCTCC TGCCGCTCCT GCCGACCCCA CTGCCGGCGG TCCGGCTGGA GCCCACGCCC GCCTTCGTCA CCTCTGGCGC ATGGCGCCCC TACGTGGCCG GTGGTCGCAG CATCGTCACC CTGCCGCTGC CGGACACCCA CTACGCCGAC CCGCTGCGCT GGTCGGCCGA GACAGGTCTG GAGATGCCGA TCGCCCGGGG GTACTTCCTC GGCCCGGACA CCCGCCCCGA CCGGCACCGC GTCGCCCTGT TCACCGCCCC AGACCGCCCG ACCAGCGACT TCTTCACCGA AATTCGGCGT ACCGGTGAGG TGCCACCAGT CAGCCAGCAG GAACGAACGG CCGCTGAGGA CGACCTGCGG TACTGGCGGG CCGGCGCGGT CGTGCTCGGT CCACACCGGC ACGCGGACGC GCTACGCCGC GGCATGACCG AGCTGATCGA GGTCCAGCCG ACCTACACCG GGGGCGTCTG GCTCTGGGAC GTGCGACACC TCACCGACTG A
|
Protein sequence | MTTADSSRAA DAPEGGASTT EPDQRNQATT RWRVDLLAVL SFLALALWVT LRLWLDPRDG LRDNRTDQAQ FEWMMAHGSR VVTDFAYPFA SDRMNVPEVV NLMANTSVLS VSIPMTPVTL VAGPRMSFLL FLTLGMAATA TSWYFLLSRV VVRSPGPAWL GATFCGFAPA MVSHANAHPN IVSQFVVPLI IWRTLRLGEP GRWLRNGLLL ALVIVWQAFL NLEILLMTAI GLGVVIVALA LGRPDLRQRT RPFLAGLGVA AGVTLVLLAY PLYVQFFGPG AYRGLSPLIR GYSTDLASFV AYSRESLAGD EPGARGLAKN PTEENAFFGW PLLVLVAALV WWLRRNVVVR ALALLAVVFA VLSLGREVLF NGEATGIPAP WAILETLPIL HSVVPTRWAL AITPVIGLLL AYGAQHARTL ATRNPSARPQ IRFATVTVLA MALLPLLPTP LPAVRLEPTP AFVTSGAWRP YVAGGRSIVT LPLPDTHYAD PLRWSAETGL EMPIARGYFL GPDTRPDRHR VALFTAPDRP TSDFFTEIRR TGEVPPVSQQ ERTAAEDDLR YWRAGAVVLG PHRHADALRR GMTELIEVQP TYTGGVWLWD VRHLTD
|
| |