Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4546 |
Symbol | |
ID | 5736942 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5816525 |
End bp | 5817634 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281708 |
Product | DNA protecting protein DprA |
Protein accession | YP_001547305 |
Protein GI | 159901058 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGAAC GTCATGCCTA CATTGCCTTT AATCTCACTC CTGGTATAGG ACCACAGCGC CTCCAAGCCC TGATTAAGCA TTGTGGCTCG GCGGCGGCAG CTTGGTCGGC TACGCTCGAC GATTGGCGGG CGGCGGGTTT GGATCGTCGC AGCATTCAAG CCTTGCAGCA TGCGCAACAA CATTTAGACC TTGAGGCCGA GCTGCGCACA ATTGCTGAAC AAAACATCAA GGTTGTGCTG CAAACTGATT CGGACTTTCC AGCCATGCTG CACACAATTG ATCCTGTGCC GCCGTTGCTG TATTTGCGTG GTTCACTGAT TGAAACTGAT CGTTGGGCGG TAGCAATTGT CGGTACGCGC AACCCTACCC CCTATGGTCG TGAGGTCACC TATAAGTTTG CTGGCGAGTT AGCCCGCGCA GGGTTGACGG TGGTTTCTGG TTTGGCCTTG GGCATCGATG CAATCGCCCA TCGCACCGCC TTAGATAATA ATGGGCGCAC CTTGGCGGTG CTTGGCAGCG GGCTGCAACA GATTTACCCT TCCCAACATC GCCAATTAGC GGCTGATGTG AGCCAACAGG GAGCCTTGCT TTCGGAGTAT GCCCCGACGA CCGAGCCATT GAGTGGCAAC TTCCCCGCGC GTAATCGCTT GATTAGCGGG CTAAGTTTGG CAACAATTGT AGTTGAAGCA GGCGAACGTA GCGGCGCATT AATTACTGCC CGCTTTGCGC TCGAACAAGG TCGTGATGTG TTTGCCGTGC CTGGCTCAAT TCTCAGTCAT AGCAGCGATG GACCAAATCA ATTAATCGTC GATGGTGCAA CACCCTTGCG TTCAGTCGAG CAATTGCTGG AGCAGCTGAA TCTGCATCAA GCGCAAGCCC AACAAACGGT CAGTACGATT GTGCCCGAAA CACCCGCTGA GGCCTTGCTC TTGCCCCATT TGAGTGGTCA GCCCACCCAC ATCGACGAAT TAGGGCGCTC GTGTGGGCTA GCGGCCCATG ATCTGGCGGC AACCTTGGGC TTGATGGAAC TCAAAGGCAT GGTTCGCCAT GTTGGTGGAA TGCATTATGT GCTTGCTCGC GAAACGCCTG CACCCTATGA TCTCTCATAA
|
Protein sequence | MDERHAYIAF NLTPGIGPQR LQALIKHCGS AAAAWSATLD DWRAAGLDRR SIQALQHAQQ HLDLEAELRT IAEQNIKVVL QTDSDFPAML HTIDPVPPLL YLRGSLIETD RWAVAIVGTR NPTPYGREVT YKFAGELARA GLTVVSGLAL GIDAIAHRTA LDNNGRTLAV LGSGLQQIYP SQHRQLAADV SQQGALLSEY APTTEPLSGN FPARNRLISG LSLATIVVEA GERSGALITA RFALEQGRDV FAVPGSILSH SSDGPNQLIV DGATPLRSVE QLLEQLNLHQ AQAQQTVSTI VPETPAEALL LPHLSGQPTH IDELGRSCGL AAHDLAATLG LMELKGMVRH VGGMHYVLAR ETPAPYDLS
|
| |