Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3319 |
Symbol | |
ID | 5735189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4183106 |
End bp | 4184071 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641280466 |
Product | SH3 type 3 domain-containing protein |
Protein accession | YP_001546083 |
Protein GI | 159899836 |
COG category | [C] Energy production and conversion |
COG ID | [COG1413] FOG: HEAT repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATC AATTACCACC AACCCAGCCA CGCCAACCCC AACCATCGCA GGCGCGGCCC GAAGCAGCCC CACCACAAGT TGCCGATGCG CAATTACGTC AGTTGGTCGT AGCTTTAGAT AACGTCAACG ATCCCAATCA TGAACGGGCC GAGGATGATC TGATTCGTTT GGGTGCGCAG GCTGTGCCCT TGCTGATCGA TGCGCTTGAT CCACGTTACC CATGGTTACG AGCCTATCGC GCTGCCGAAG CTCTCGGCCA AATTGGCGAT GGCCGCGCCA GTCGTCCCTT GAGCCAAGCC TTAAATCATC CCAACAGCAA TGTGCGCTGG GCTGCCGTGC GAGCTTTGGG CAAAGTTGGT GATGGTCGCG CCTTGTTGGC CTTACGCCGC ACTGCCCGCG ATGATCGTAG CCGCACCAGT TGGGGCGAGC CTGTAGCTGC TAGTGCTGCT GCAACGTTGC GCGAAATGCA ACGTACCAGC ACAATCCTGC GACTTTCCGA TCCCATTCGG ATTGCCTTGT TGGTTGCGGT AGCCTTCTTT GCCTTGTTTT GGGCAAATGA TCGGATTACA ACAGTCCGCG GCGCAATCAA CGAGTCGAAT CATGTAGTTT GGGGCACTGC CGTTACGCCA ATTTTGCCAA CCGCAACCCC AGTCAGCGAA GATCTCAGTA GCGAAGATGA TCCTGAAGAG GATTTGGTTG AGGAAACGCC CACGGTTGAT CCGGCGGCGA CCCCAACCTT GGCTCCGGCA ACTGCCACAG TGGTTGCACC AACCGCTAAT GTTCGGCCAG CACCAAACAC CAATAACGAC CCAATCGCCC AGTTGAAAGC TGGCGATAGT GTGCAAGTGC TTGGTCAATC TGGCGATTGG TATGAAATAC AGTTGCCCGA TGGAACTGGC CGTGGTTGGG TGGCTTCAAG TGTGCTTGGC CCGCCAAGTG GGCCTGTGCC AACGGTTACT AATTAA
|
Protein sequence | MTDQLPPTQP RQPQPSQARP EAAPPQVADA QLRQLVVALD NVNDPNHERA EDDLIRLGAQ AVPLLIDALD PRYPWLRAYR AAEALGQIGD GRASRPLSQA LNHPNSNVRW AAVRALGKVG DGRALLALRR TARDDRSRTS WGEPVAASAA ATLREMQRTS TILRLSDPIR IALLVAVAFF ALFWANDRIT TVRGAINESN HVVWGTAVTP ILPTATPVSE DLSSEDDPEE DLVEETPTVD PAATPTLAPA TATVVAPTAN VRPAPNTNND PIAQLKAGDS VQVLGQSGDW YEIQLPDGTG RGWVASSVLG PPSGPVPTVT N
|
| |