Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_1082 |
Symbol | |
ID | 3933526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 1045421 |
End bp | 1047265 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637903430 |
Product | pentapeptide protein |
Protein accession | YP_509024 |
Protein GI | 89053573 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0577925 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAAGC GCCTCAACCT CAAGCTTTCG CGGCACCGCG CCCAGACGGA TGTACCCGCC TCCGCCCCCG CCCCCGCGCC GACCCAAACC GCCGATGCCG CGAAGCAGGT CGCGCGCATC AATGACCTGA TCAGCCTGGC CCGCAAATCC TGGTTCGGCC TCCTGTCCTA TCTGGGATTT GTCGGCGTGA CGCTTCTTGG CGTCGAGGAT GCGGATTTCT TCATTGCGGA TCGTCAGACC GACCTGCCCC TTGTGGGCGT CTCGATCCCC ACGGTGCTGT TTTTCTATAT CGCCCCCACG ATTGGGGCCG CGCTTTACGC CTATTTCCAC CTGCATCTGC TGAAACTCTA TGAGGCGCTG GGTCGGGCCG ACAGAACCCA TGTGCCCCTG GCGGACCGGA CCGCGCCCTG GATGGTCGCG GACTTCGCCC TGGCCTGGCG ACCGGGTGAG ATGTCGAAAC GTCCAATGCG CTGGCTGGCC TTCGCGACCA CGCTCCTGCT CGCCTATATC GCGGGGCCGC TGGTCCTCTT CGCGTTCTGG TTCCGCTCCA TGCCCTATCA TGAGGAGCTT CTGACGGTCA TTTTCTGTGG CATCCCCCTG ATGGTCGTCC TTTATGCCGG GCGGCGCAGC TGGTCGTACA TGATCGCCCA GTTTCGCCCC GATCAGCGCC GCCGCCCCAG CGTCCAGTTC TGCCTTGCGC TGCTTGGGGG CGTGGCGCGC GTTCTCGTCG TGGCGCTTGG ATGGCTGGCC ACCGAAGGCT CCTTTGAGCA TTACGTGCGC CAGTTCGACT GGTTCGACGA CACCGGCGAG ATGACCCATC TGGGCAGCAA TCCGGCCCAT GCCCAATGGT GGGGCGATCC GCGCTGGGAT TTCATGTTGT ACCCCGCCAA TCTTGCCGGC GTTAACTTCG TCGATGCGCC GGAAGACTGG CGCGAACACG ACGCTGCCCG CGCGGAGTTT CGCGATGCAT GGTGCGGAAC CGAGGGGATC CCCGAGCAAG CCTGCGGCTC CGCCGATCCC TTTGCCCAAC TCTCCGATGA AGCCGCCGAG TTCCTGCAAG CCGCCCGCCT CGGCTGGTGT GCTGACATTA TCGGCGGGGC CGTGGCAGAC CTCCAGCCCC TCTGCGACAC CCGGTTCCTG GAGTTTGAAA GCACCTTCGC CGCGGATTGG ACGACCGAAT GGGCCATCCT GCTCAACAGC CTCACCGCGC GCAACCTGTC CGGGTTCGAT CTGCGCGGCG CGGATTTGAC CGACGCGCAG TTGCAGGGCG CAAACCTCAG CACGGCGCGA TTGCAGGGGG CGTCCCTCCG GGCGGCGGAG TTGCAGGGGG CAAACCTCAG CCAGGCCAAG ATGCAGGGGG TCGTCCTCTT CGGGGCATTG ATGCAGAGGG CGGTTCTTGG GCAGGTGGAG ATCCAACAGG CCGATCTCAG CTTTGCGCAG ATGCAGGGCG CTTTCTTCAG CATCGCGCAA ATGCAGGGGG CGGACCTTTT CGGGGCGCAG GTGCAGGACG CCGTGTTCAG CCGGGCGCAG ATGCAGGGCA CGGTTCTGTT CGGGGCGCAG ATGCAGGGGG CCGACCTCTT TGAGGTGCAA TTGCAGGGGG CTGACCTCAG AAGCGTCGCC ATGTCGGACA CCACCGCCCT GTTCAATTCA TCCATCCGGG GGGCTGGCAT CAATTCCGTG GATGATGCGA CGCTGGCGCA GCTCAGCGCC CATCGCGACT GGGACGATGC GTATTATCAG GACGAGGTCC TCTTCTTCGT CGCGTTCACC GACGACTGGC GCGCCTTCGC CACCAATCTC GACCCGCCCG TCACCATCGC CCCCGACTTT GAGCACCGGG ACTGA
|
Protein sequence | MSKRLNLKLS RHRAQTDVPA SAPAPAPTQT ADAAKQVARI NDLISLARKS WFGLLSYLGF VGVTLLGVED ADFFIADRQT DLPLVGVSIP TVLFFYIAPT IGAALYAYFH LHLLKLYEAL GRADRTHVPL ADRTAPWMVA DFALAWRPGE MSKRPMRWLA FATTLLLAYI AGPLVLFAFW FRSMPYHEEL LTVIFCGIPL MVVLYAGRRS WSYMIAQFRP DQRRRPSVQF CLALLGGVAR VLVVALGWLA TEGSFEHYVR QFDWFDDTGE MTHLGSNPAH AQWWGDPRWD FMLYPANLAG VNFVDAPEDW REHDAARAEF RDAWCGTEGI PEQACGSADP FAQLSDEAAE FLQAARLGWC ADIIGGAVAD LQPLCDTRFL EFESTFAADW TTEWAILLNS LTARNLSGFD LRGADLTDAQ LQGANLSTAR LQGASLRAAE LQGANLSQAK MQGVVLFGAL MQRAVLGQVE IQQADLSFAQ MQGAFFSIAQ MQGADLFGAQ VQDAVFSRAQ MQGTVLFGAQ MQGADLFEVQ LQGADLRSVA MSDTTALFNS SIRGAGINSV DDATLAQLSA HRDWDDAYYQ DEVLFFVAFT DDWRAFATNL DPPVTIAPDF EHRD
|
| |