Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0910 |
Symbol | |
ID | 6743722 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 857729 |
End bp | 858784 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 642750716 |
Product | loricrin |
Protein accession | YP_002121575 |
Protein GI | 195953285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0000184739 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCAATATTGG AGAGCTTATA GTGGGCCTTA GCATAATAAC ATTTACAGGC ATAGCAACTG TTTATCATTT TTCTAATGGA GCAGTTTTTT CAAATACAGG TACAACTCCA ACTCCTGTAA TATCAAACAC TCCTCCTTCT AGTGCTACAC CAGTAACTAT ATTTGGTGGT GCATTAAATG GTGGTGGTGT AGGTGTGGGC GGTGGAGGTG GAAGCGTTAG CTGTGTTCAT AGTAGCACTA CTTATCAATA CGATGGACAA TATGGTTGGG ACGGCTCGCC ACAATATTTT TATATTACCT CAAGCATCAC CGCATGTGGT AGTATAAGCT TCACACTTTA TGGGGGTGGA GGTGGTGGAT CAGCATATGG CAATGGAGCA GGTGGTTCGG GAGGAATTAC CACTGGTATA ATTTCAACCA GTACTCTTTC AAGTAGTACA TTAACCATAA TAGTAGCTGG CGGAGGACAA TACGATAGTG GTGGTGGTGG ATCGTTTGTT TGTTTAGGTA CACAATGTAG CTTATCAACA GCAATATTAG CAGCTGGTGG TGGTGGAGGG GCAAACCAAA ATTACAACGG TGGAAACGGT GGAGGTGGCA ACCAAAATGG CCAAAACGGA AACGTTGGGT GTGGTTCTGT ACCGCAAGGA GGAACCTTGA GAGCCGGTGG TAGAGGCGGT GCTTGTCAAA CACGAGTGCA ACCTTATAAC AACGGTATAG CCGGAGCTGG TGGGGCTGGT GGAAACTACG GTGGTGGGCA AGGTACTCCT TTTGGTACTG GCGGTATAGA TCCCAATGGC AACAACGATA ACGGTGGTTA TGGAGGTGGG GGTTCTGGAG CTGGTGGATA CGATGGAGCT GGTGGTGGGA GTGGATATTA TGGAGGTGGG GGTGATAATA ATTCACCAAA TTGGGGCTCT GGTGCTGGAG GTTCTGGATA CTGTGCATCT ATAGTCCAAA GCTGCGGGGG TTCTCCTGGA GGGGCCCACA ACGGTGGCAC AAACGGCAAT CCAGGAAGTA ATGGACAAGT CATAATCTCT TGGTAA
|
Protein sequence | MKKINIGELI VGLSIITFTG IATVYHFSNG AVFSNTGTTP TPVISNTPPS SATPVTIFGG ALNGGGVGVG GGGGSVSCVH SSTTYQYDGQ YGWDGSPQYF YITSSITACG SISFTLYGGG GGGSAYGNGA GGSGGITTGI ISTSTLSSST LTIIVAGGGQ YDSGGGGSFV CLGTQCSLST AILAAGGGGG ANQNYNGGNG GGGNQNGQNG NVGCGSVPQG GTLRAGGRGG ACQTRVQPYN NGIAGAGGAG GNYGGGQGTP FGTGGIDPNG NNDNGGYGGG GSGAGGYDGA GGGSGYYGGG GDNNSPNWGS GAGGSGYCAS IVQSCGGSPG GAHNGGTNGN PGSNGQVIIS W
|
| |