Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Jann_2003 |
Symbol | |
ID | 3934456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Jannaschia sp. CCS1 |
Kingdom | Bacteria |
Replicon accession | NC_007802 |
Strand | - |
Start bp | 2005022 |
End bp | 2007007 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637904359 |
Product | glycosyl hydrolase, BNR protein |
Protein accession | YP_509945 |
Protein GI | 89054494 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.075614 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.402986 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAGC CCGCCGATAT ACATCCCAAG CCAGAACCCG CCACCTCGCC CGCGCCCCCG GTGTCGCCGC CCGCCGATAT TCCGTTCCCC TTCGCAACAC TCGTCGAGAA CCAGCGCAGC CGAGCTCACA ATGCCACGCG CCGCGCCCAG GGCATCTTTG CCGCACTAAT CGCCGTTATC CTACTAGGTA TCGGGTACTA CGTAGGCCTG CCGTTGTACC TCCAATGGAA CGAGTCGTCA CGCAGCGTCT ACGAAGCTGA GGCGGGTACG CTGTCCTCGC AGTTGGACCG GATGGATGAG GCAAGGGACC GCATATGGAC AGCTCTGTCC GAAGAGCTGA AGGGAATCGG AACTCCTCGC CCCTCTGGCG TCGACGTAAA CCTGAATGAC GTCCGTAGTC TGCCCGACGG TACTATCTTG GTCGCCGTGG GGCGGCACGG CACCGTCATA CGGTCCACCG ATGCCGGCAA CACTTGGACA CCTCGTCGCT CCGGCGTGGA TATTGATCTA TTCAACCTCC GTGTTCTGCC CGACAGCACC ACCCTGGTCG CTGTGGGAGA GGGCGGCACC GTCATACGGT CCACTGATGC TGGCAACACT TGGCAGTCCC GCCCTTCCGG CGTCAATAAC AGTCTGTACG ACTTACGCGT GCTACCCAAC AGCACCACCC TAATCGCTGT TGGACGAAGC GGTGCAGTCA TACGGTCCAC TGATGCTGGC AACACTTGGA TGCCCCGCCC CTCCGGCGTG GATGTTGATC TATACAACCT TGGCATTCTG GCGGACGGCG CCACCTTGGT CGCTGTAGGC GAAGACGGCA CTGTCATACG CTCGACCGAT GCCGGCGAAA CTTGGACATT CCTTCCCTCC GTCGTGGATG CCGATCTGTT CAACCTCGGC ATTCTGCCCG ACAGCACCAC CTTAGTCGGG GTAGGAGAGA GCGGCACCGT CATAAGGTCT ACCGATACCG ATAGCACCTG GATGCCCCGC CCCTCCGGCG TCATTGCGGA TTTGTATGTC CCTCGCGTGC TGCCCGACGG CGTTACTCTG ATCGCGGTGG GATCAAGTGG CGGAATCATA CGCTCGACCG ATGCCGGCAT GACATGGACG CCCCGCCCCT CCGGCATCGA TGGGAATCTG TTCAACTTCA GCGTTTTGCC CGACAGCGGC ATCCTGGTGG CGGTAGGGTC AGATGGCGCA GTCATACGCT CGACCGATGC CGGCGTGACA TGGACGCCCC GCCCTTCCGG GCAAGATATT ACCCTAAGAG ATATCCGCGT TCTACCCGAT GGTACGACAC TGATTGCGAT AGGTTTTTTT GGCGCGATTG TGATTATCGA CGACCGTTAC GCCGATGCCC TCGCCGCGAT CGGACCTCTC TCCGGCTCCC TCGGCGATAA TGCCTACCGC AGCGGCATTG CCGGGCTCCC GGAGTATGTT CGCAACCACC CCGTCGTTGG GGCCTTACTC GCCGAACTGG ACGGCACCAT CGAAGGTCGT GCCGACCTCG AAACCCGCCT TCAATCCGCC CGCGCCTCCG CCGACGAAAT CAGGACAGGC GGCTTTTCTC TCGCTCAACG GCGCCAGGAT TTCGAGGAGT TCATGCGCGT ATGTACAGCC GACCTGTCAG ACGAGGCAGA GGGCGTGGGC ACCGAACACT GCACCCGCGC CTATGTCGAC CTTCGCCAAG CCGAAAGCCA GACGGTGTGG GAGATCCTTG CCGAACGGGC GCCGCAGGCC ATCTTGCTGT TGTTCCTCCT CGCAACGCTT GCGGCACTTT ATCGGTACAA CATGCGCCTT GCGGGTTTCC ACGCCGCGCG AGCCGACGCG CTGCATCTCT ACGCCATGGG CCGCACCCAC GACCCCGCCA TTCTGACGGA GTTCTCGGAC GCACTAGCCG CCGATAAGGT CGAGTTCGGC AAGGGGAACA CACCGTCGGA GCAGGCTGTC GAGATCGCAA AGGCTATGGT CGGGCGGCGT GGGTAA
|
Protein sequence | MNQPADIHPK PEPATSPAPP VSPPADIPFP FATLVENQRS RAHNATRRAQ GIFAALIAVI LLGIGYYVGL PLYLQWNESS RSVYEAEAGT LSSQLDRMDE ARDRIWTALS EELKGIGTPR PSGVDVNLND VRSLPDGTIL VAVGRHGTVI RSTDAGNTWT PRRSGVDIDL FNLRVLPDST TLVAVGEGGT VIRSTDAGNT WQSRPSGVNN SLYDLRVLPN STTLIAVGRS GAVIRSTDAG NTWMPRPSGV DVDLYNLGIL ADGATLVAVG EDGTVIRSTD AGETWTFLPS VVDADLFNLG ILPDSTTLVG VGESGTVIRS TDTDSTWMPR PSGVIADLYV PRVLPDGVTL IAVGSSGGII RSTDAGMTWT PRPSGIDGNL FNFSVLPDSG ILVAVGSDGA VIRSTDAGVT WTPRPSGQDI TLRDIRVLPD GTTLIAIGFF GAIVIIDDRY ADALAAIGPL SGSLGDNAYR SGIAGLPEYV RNHPVVGALL AELDGTIEGR ADLETRLQSA RASADEIRTG GFSLAQRRQD FEEFMRVCTA DLSDEAEGVG TEHCTRAYVD LRQAESQTVW EILAERAPQA ILLLFLLATL AALYRYNMRL AGFHAARADA LHLYAMGRTH DPAILTEFSD ALAADKVEFG KGNTPSEQAV EIAKAMVGRR G
|
| |