Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4273 |
Symbol | |
ID | 5705778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4848661 |
End bp | 4850334 |
Gene Length | 1674 bp |
Protein Length | 557 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641273692 |
Product | hypothetical protein |
Protein accession | YP_001539045 |
Protein GI | 159039792 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.4813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.012462 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCTTTC GGACCTGGGG CAAACTGCTA CTCACGGCGC TCGGAGCGAG CCTGCTGGCC GGAGCCATTC AGCTCGGCAT CGCATTTGGC TTCGGCATCG TTCGACTCAC CGGGGCCTTC ACCGGCGACA GTGTCAACCA GTGGCCGGCG CAGCTCACCT GGGTCGGTTG GATAGCGGCG AATGCCGCCG TCGCCGGTGC CATCGGCGCC GAACGGTTGG CTCGCCGGGA CGGCCTGCTG ACCGGTATCG GTAGGCAGGC GGCCGTAGCG GGTGCCGCGA CGCTCGGTGC CATCGTCGTC GCGCCGCTCT GCATGCAGCC CGCGCGGGCC GCGGACCTGG TCACTGCGGA ACCGGTCTGG GCCGTCGCCA CCTGCGCCGT CCTCGGCGCG GTGGCCGGCG GTGGTGCGGC GCTGGCCGTC CTGGCACGGC CCCCGCTGGG CTGGAGCGTG GCGCTCGTCG CCGGCGTGGT GTGGCTTCTC GCTCTGATCT CGGTTGCCCC GTCGCTGGGG GAGACCGGGC CACTGCGGAC CGTACGCCTC GGCATGTGGG AACCGTCCTG GCTGAGCGCC ACCGCGGCGC ACCGCCTGTC TCTGCTGTGT CTGCCGGTGG TGGCACTGCT GGTTGGAGCG TCCACGGGCG GCCTGGCCCG TCGGCGCGAG CTCCCGCCAC TGGTCGGTGG CCTGACCGGG GTAGCCGGAC CGGTGCTGCT GGCCTTCACC TACCTGGCCG CGGGTTCCGG CGACGAGGTG GATCGGTACC AGGCCGCGCC CTACTACGGC GCTCTGCTCG CGGTGGCCAC CGGTGCGCTT GGCTCGGCCG CCACCGTCGT GCTGCGCTGG CCACTCGTGG TGCACCCGGC TGACCGGTCC GTTGCGACAC CCGGCACAGA GGCCGCCGGC GCCATGGTGG ACGACACCTC ACCGAGCACC AACCCGGCAT CGAACCCCGA GACCACGCTG AGCCCTGAGC CACCACCGAA CACCGAACCA CCACCGAACA CCGAACCAAC CCTGAGTCCC GGGCCCGCTC CGACCCCCAA GCCGGCACCG AGCCCTCGCC GCGCGGGGCG ATCCGAGGTC ACCTCCCGAC CCACTCCAGT CACCCCAACA CCGGTCACAT CGCCCCGGCC CGTCCCGACC CCCGTCGAGT CGACCACACG TACCGCCGTT GACCCCGTAC CCGCAACACC ACCCCGGCCC GTCCCGACCC CCGTCGAGTC GACCACACGT ACCGCCGTTC AGCCCGTACC GGTAACCCCG CCCCGGCCGA CACCGACCCG GATCGGACCC GAGCCGTTCC CGATGCTGTC GCCGACACCG CCGACACCGC CCGTGACCCG CACTTTCCCG GCAGACCCGC CCACCTCCAC CGCCGCACCT GTCGTGTCCG GGGCAGCCCC GCCAACGGAC GACGGGTCCG ACCCGGACGC CCAAGGTGAT CGGTCGGCGC CCGACGAAAC TCCGCCTGTA GCCCGCCGCC GGGGCCTGTT CCGGCGCCAC CGCTCCCGCC CCAACGACGC GGGCGACCCG ACCGGGCCGG TACAGCTACC AGCGCAGGAC GAGGAGTTTG TCGACTGGGT CACCGGCCTG AGCAAGCCTG CTCCGGACAA CGAAGCCGAC CCGGAACGCG TCCGACGCTC GTTGCGCTCC GTCGGCCGAC ACCACGCCGA CTGA
|
Protein sequence | MAFRTWGKLL LTALGASLLA GAIQLGIAFG FGIVRLTGAF TGDSVNQWPA QLTWVGWIAA NAAVAGAIGA ERLARRDGLL TGIGRQAAVA GAATLGAIVV APLCMQPARA ADLVTAEPVW AVATCAVLGA VAGGGAALAV LARPPLGWSV ALVAGVVWLL ALISVAPSLG ETGPLRTVRL GMWEPSWLSA TAAHRLSLLC LPVVALLVGA STGGLARRRE LPPLVGGLTG VAGPVLLAFT YLAAGSGDEV DRYQAAPYYG ALLAVATGAL GSAATVVLRW PLVVHPADRS VATPGTEAAG AMVDDTSPST NPASNPETTL SPEPPPNTEP PPNTEPTLSP GPAPTPKPAP SPRRAGRSEV TSRPTPVTPT PVTSPRPVPT PVESTTRTAV DPVPATPPRP VPTPVESTTR TAVQPVPVTP PRPTPTRIGP EPFPMLSPTP PTPPVTRTFP ADPPTSTAAP VVSGAAPPTD DGSDPDAQGD RSAPDETPPV ARRRGLFRRH RSRPNDAGDP TGPVQLPAQD EEFVDWVTGL SKPAPDNEAD PERVRRSLRS VGRHHAD
|
| |