Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3698 |
Symbol | |
ID | 5735547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4651088 |
End bp | 4652641 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280850 |
Product | hypothetical protein |
Protein accession | YP_001546462 |
Protein GI | 159900215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.163977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTCTT TGAAATATAT AAGTCTGGCG TTATTACTCG TGGCCTGCGG TGGTCAAAGC CCTGCAACGA CCACGCCAGC AGCAACCACC AATTTACCAG CCCAAGCCAC CGCCACCATC CAAACTGGCT CAGCAGCAAC TACTATTCCA ACAACCGTGC CCCAAGCAAC GGGTATGCCG CAAGCAACGG TTGCTACGGC CAACCAAATT AATTTAAGTG CGCCGCCTGT GGATGCGGTG TTGACCGATT CGGTGCGCGT GGCTGGCACG ATCGTGCTTA CCCCATTCGA AAAAACCTTG CGCCTCGTGA TTCAAACCAA CGACGGCAAC ATTCTGTATG AGGGGCCAAT TAATACCACT GGCGAATATG GCAGCAGCGC AACCTTCGAT GTGACCGTGC CAATCGTCGC AGCAGCAAGC GGCCCAGGTG TGATCAAGGT GATCGAAGAT GATATGAGCG GTGAATTGCC CTATCGCACA ATTGCCGAGC AACCAGTCCA ATTTACTTCG ACTTCTGCCG AACCGACGCC TGCCGAGCCA GGAATTTTGA TCGAATTGAC TGAACCAGCG ATGCATGCGG TCGTTGGCAA TCCATTGAAT TTCAAAGGCA CGCTCTCGGC AATGCCCTTT GAAAAAAATG TCGTGATCGA AGTCTATGAT AGTGAATTGC ATTTGCTCGG TCAAACCAGC GTGATTGCCG ATGGCGAATA TGGCTCGGCT GGAACATTCA GCGGCAGTAT CAATTTTCAA GCCCCCTTGA GTAGCCGCAT CGGTCGGATT GTGGCCTATA CAACCTCGCC CAAGGATGGT TCAGTCGTTG GGCGTGACGA AGCAACTCTG ACGTTGCCTG CTTGGAATGG CACAGGCGCA TATTTGGCCC AACCTGCGCC CGAAACCAGT GCCTTCTTGC CTTTGCATGT TGAAGCCGTT GGTTTAAGCA GCGATACTTA CACTGTGCGT TTGCGCTACG CCGATGGCAC CCTGTTGGAA AACACTACCC AAGCCTACAA CGGGTATTTG GCCTTGAGTT TGATGTGGGA CAATGCTGCA CCCATTTTAC CCAACCAAAG CGCAATTTTA GAGTTGGTCA AGGCTGATGG CACAGTTGAG TTGACCCAAA ATCTGTATAT GCAAGATCTA ACCAGCCAAC CGACTACGAG TGTCGAAGTC TCTTGGTTAG CTGGCGAAGG CTCGATCAAT GGGATTCGGA TTTTACCAAA AACCTCAAGT GTCGCCAGCG CTGCCTTACG CGAGTTAGTT TGGGGTCCAG TTGGCAAAGA TTCAGCCTAT AGCACAGCGA TTCCTAGCCC TAAAATTATT GCCGATTACA CTGGTGATAA AACTGGCTGG ACTGGGCGGG TGCATCTGCG CTCAGTGCGG ATCGAAGGCG ATATCGCCTA CGTCGATTGG AGTCGCGAAA TGCGAGCATG GGGCGGTGGG TCAATGCAAC TTGAATCACT GCAAGCTCAA GTTGACCTAA CGCTCAAGCA ATTTTCTCAA GTCAAGCAGG TTGTTATGAC GGTTGAAGGC AGTGAAGAAG TGCTCCAACC ATAA
|
Protein sequence | MRSLKYISLA LLLVACGGQS PATTTPAATT NLPAQATATI QTGSAATTIP TTVPQATGMP QATVATANQI NLSAPPVDAV LTDSVRVAGT IVLTPFEKTL RLVIQTNDGN ILYEGPINTT GEYGSSATFD VTVPIVAAAS GPGVIKVIED DMSGELPYRT IAEQPVQFTS TSAEPTPAEP GILIELTEPA MHAVVGNPLN FKGTLSAMPF EKNVVIEVYD SELHLLGQTS VIADGEYGSA GTFSGSINFQ APLSSRIGRI VAYTTSPKDG SVVGRDEATL TLPAWNGTGA YLAQPAPETS AFLPLHVEAV GLSSDTYTVR LRYADGTLLE NTTQAYNGYL ALSLMWDNAA PILPNQSAIL ELVKADGTVE LTQNLYMQDL TSQPTTSVEV SWLAGEGSIN GIRILPKTSS VASAALRELV WGPVGKDSAY STAIPSPKII ADYTGDKTGW TGRVHLRSVR IEGDIAYVDW SREMRAWGGG SMQLESLQAQ VDLTLKQFSQ VKQVVMTVEG SEEVLQP
|
| |