Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2626 |
Symbol | |
ID | 5734504 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3369286 |
End bp | 3370443 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279766 |
Product | hypothetical protein |
Protein accession | YP_001545392 |
Protein GI | 159899145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000023102 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGTGT TAACAATGGT GTTAGCAGGT TGTGGTCAAG AAGATATCAC GGCTGAAAAC ATCGTTACCA AAATTCGGGA AGGCCAAGCC AATACCAACA ATGTTCATGC AATTATGCGC GTCGATTACA CCTCGGATCA AGCAACCGGC TTTATCAAGG CCGAAGTTTG GTCGAGCAAA ACTGGCAACA AAGATGCTGA AGGCAACGAA ATCAAAGCCT TCCGCGCTAA AATTCTTGAT GCCAACGATG CTAAGATGGT TGGCGCGGAA GTCGTGACTG ATGGTACTCA AGGCTGGTTC TACGACCCCG CTGAAAATAA AGCCTATGTT GGCACTGCCG CTGAGTTGAG CGAATCGCAA ATGGGCCAAG GCCAACAAGA TCAAGAGCAA GGCCAAGGCG GTCGCGGTCG CATGGATCAA ACCGAAATGC TCGGCCAATT GCAAGGTGTC GTTGAAAAAG GCTTGGATGC CTTAACAATC ACAATTTTGG GCGAAGAAGA AGTTGCTGGT TTCAATACCT ACAAAGTCAG CATTGCGCCA AAATCAGATG CCCAACAACA ATTGCCAATC GATTTGTTGG TTGACATCAA CGTTTGGTTG GAAGATACCA ATTGGATGCC AGTCAAGTTT GTGCTTGATG CCAAAGATAT GGGCAATGTC AGCTTTGCCG CTGAAACCCT TGAAGTCAAT AAAGGCATCG ATATGGCGAT TTTCAGCGCT CAGCCACCAG CTAACGCCGA AATCGTCAAG ATCGCCGATA TGATCAAGGA AATGGAACAA GCTGCTCCAG CTACCGATTT GACCTTGGAC GAAGCTAAGG CACAAGCAGG CTTCGAACTG TTGGCTCCCG GCGAAACCTT GGGCACAACC TTGAGCGATG TTCAATTGTT GACCCTTGCC AAAGCCACCA CCGTAATTCA AACCTACGCT GGCGCTGATG TGGAATGGAC CTTGGTTCAA GCCAAAGGCG ATGATCCAGA TGCTAAGATT CAACGTGGCG GCACTGAAGT AACCGTCCGT GGCGTTCAAG GCACCTTGAT TGAAGGCCGT GGCAATGTTG GTGTGGTGTT GACTTGGCAA GAAGGCGATG TGACCTTCGT GATTACTGGC AACATCGATG GCGAACAAGC CACCAAGATT GCCGAAAGCC TCAAGTAA
|
Protein sequence | MLVLTMVLAG CGQEDITAEN IVTKIREGQA NTNNVHAIMR VDYTSDQATG FIKAEVWSSK TGNKDAEGNE IKAFRAKILD ANDAKMVGAE VVTDGTQGWF YDPAENKAYV GTAAELSESQ MGQGQQDQEQ GQGGRGRMDQ TEMLGQLQGV VEKGLDALTI TILGEEEVAG FNTYKVSIAP KSDAQQQLPI DLLVDINVWL EDTNWMPVKF VLDAKDMGNV SFAAETLEVN KGIDMAIFSA QPPANAEIVK IADMIKEMEQ AAPATDLTLD EAKAQAGFEL LAPGETLGTT LSDVQLLTLA KATTVIQTYA GADVEWTLVQ AKGDDPDAKI QRGGTEVTVR GVQGTLIEGR GNVGVVLTWQ EGDVTFVITG NIDGEQATKI AESLK
|
| |