Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3257 |
Symbol | |
ID | 5735125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4118878 |
End bp | 4119885 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280403 |
Product | hypothetical protein |
Protein accession | YP_001546022 |
Protein GI | 159899775 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00599549 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCAG ACGTAGTACA AAGCGATTAC GAACAGCTCG GTCAAGTGGC AACGCGCTTT CAAAAGCTGC ACGATCAACA AACTCAGATG GAAGCCATGT TGCGCCAAAC CTATCAGCAA TTGCGCACCG ATTGGAAGGG CGATGCCGCT GTGGCCTTTT TTGGCGAAAT GGATGACAGC ATTTTCCCCG CTCTCAAACG TTTACAAACC ACGCTGACCT CTGCTAGCGC CCTGACCACC CAAATCTCAC AAATATTTCG CCAAGCTGAA GAAGAAGCTG CTAAGGGCAT TCAGTTTGAT GGCGGTGGAG TGGCGGCTGG TGGCGGGGCT GGCGCTGGCG CTAGTTTTTC TGCTGAAGGT GGGTCATTTG ATGCTGGTGG CGGTGGTGGC GCTGGGCTTG ATCCAGCGAT TAGTGCGCGT TGGGCAACCC TTACCCCAGC AGAACAAGCG GCGGTGCTCC AAAGCATCTC CAACGAAATC TGCGATAAAT ATGGCATTGA GCATGTGCCA GTCACGGTTT CCGATCTGGC CGACCCACCA GGCCTAGATT TGTTCGGCTA TCGCAATGAT CAAGGGGTGT TTATCGACCT TGATAACATG GGTGATCCTG ATCGCGTTTT GAATACTGTG GCGCATGAAG TTCGCCATGA GGTTCAACGC CAAATGGCTA ATTTAGCCAA CCCCAGCAAC TTCGATAAGT TCTTGCGCTC AATTGGCATC CAAGCTGAAC CCACATGGCC GATGAACGAT GTCACTGAGG CGATGGCCAA CGAATGGCAC GAAAACTTCA ATAACTACAT CACCCCAGAA AGCGATTTTG CGGGCTATGA AAGCCAACCA TTGGAAACTG ATGCCCGTAA TTATGGCGAA ACCTATCGCG ATAATTTGAC CTTGACTGAT TTCGAACGCC ATATTCCAAC GCCAAAGCCT GAGCCATTGC CAGTGCCGAT ACCAGGCCCA AGCCCAACGC CACCAGCTAC AACGCCAACC CCACGCACAA CAAATTAG
|
Protein sequence | MAADVVQSDY EQLGQVATRF QKLHDQQTQM EAMLRQTYQQ LRTDWKGDAA VAFFGEMDDS IFPALKRLQT TLTSASALTT QISQIFRQAE EEAAKGIQFD GGGVAAGGGA GAGASFSAEG GSFDAGGGGG AGLDPAISAR WATLTPAEQA AVLQSISNEI CDKYGIEHVP VTVSDLADPP GLDLFGYRND QGVFIDLDNM GDPDRVLNTV AHEVRHEVQR QMANLANPSN FDKFLRSIGI QAEPTWPMND VTEAMANEWH ENFNNYITPE SDFAGYESQP LETDARNYGE TYRDNLTLTD FERHIPTPKP EPLPVPIPGP SPTPPATTPT PRTTN
|
| |