Gene Haur_4546 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4546 
Symbol 
ID5736942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5816525 
End bp5817634 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content55% 
IMG OID641281708 
ProductDNA protecting protein DprA 
Protein accessionYP_001547305 
Protein GI159901058 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAC GTCATGCCTA CATTGCCTTT AATCTCACTC CTGGTATAGG ACCACAGCGC 
CTCCAAGCCC TGATTAAGCA TTGTGGCTCG GCGGCGGCAG CTTGGTCGGC TACGCTCGAC
GATTGGCGGG CGGCGGGTTT GGATCGTCGC AGCATTCAAG CCTTGCAGCA TGCGCAACAA
CATTTAGACC TTGAGGCCGA GCTGCGCACA ATTGCTGAAC AAAACATCAA GGTTGTGCTG
CAAACTGATT CGGACTTTCC AGCCATGCTG CACACAATTG ATCCTGTGCC GCCGTTGCTG
TATTTGCGTG GTTCACTGAT TGAAACTGAT CGTTGGGCGG TAGCAATTGT CGGTACGCGC
AACCCTACCC CCTATGGTCG TGAGGTCACC TATAAGTTTG CTGGCGAGTT AGCCCGCGCA
GGGTTGACGG TGGTTTCTGG TTTGGCCTTG GGCATCGATG CAATCGCCCA TCGCACCGCC
TTAGATAATA ATGGGCGCAC CTTGGCGGTG CTTGGCAGCG GGCTGCAACA GATTTACCCT
TCCCAACATC GCCAATTAGC GGCTGATGTG AGCCAACAGG GAGCCTTGCT TTCGGAGTAT
GCCCCGACGA CCGAGCCATT GAGTGGCAAC TTCCCCGCGC GTAATCGCTT GATTAGCGGG
CTAAGTTTGG CAACAATTGT AGTTGAAGCA GGCGAACGTA GCGGCGCATT AATTACTGCC
CGCTTTGCGC TCGAACAAGG TCGTGATGTG TTTGCCGTGC CTGGCTCAAT TCTCAGTCAT
AGCAGCGATG GACCAAATCA ATTAATCGTC GATGGTGCAA CACCCTTGCG TTCAGTCGAG
CAATTGCTGG AGCAGCTGAA TCTGCATCAA GCGCAAGCCC AACAAACGGT CAGTACGATT
GTGCCCGAAA CACCCGCTGA GGCCTTGCTC TTGCCCCATT TGAGTGGTCA GCCCACCCAC
ATCGACGAAT TAGGGCGCTC GTGTGGGCTA GCGGCCCATG ATCTGGCGGC AACCTTGGGC
TTGATGGAAC TCAAAGGCAT GGTTCGCCAT GTTGGTGGAA TGCATTATGT GCTTGCTCGC
GAAACGCCTG CACCCTATGA TCTCTCATAA
 
Protein sequence
MDERHAYIAF NLTPGIGPQR LQALIKHCGS AAAAWSATLD DWRAAGLDRR SIQALQHAQQ 
HLDLEAELRT IAEQNIKVVL QTDSDFPAML HTIDPVPPLL YLRGSLIETD RWAVAIVGTR
NPTPYGREVT YKFAGELARA GLTVVSGLAL GIDAIAHRTA LDNNGRTLAV LGSGLQQIYP
SQHRQLAADV SQQGALLSEY APTTEPLSGN FPARNRLISG LSLATIVVEA GERSGALITA
RFALEQGRDV FAVPGSILSH SSDGPNQLIV DGATPLRSVE QLLEQLNLHQ AQAQQTVSTI
VPETPAEALL LPHLSGQPTH IDELGRSCGL AAHDLAATLG LMELKGMVRH VGGMHYVLAR
ETPAPYDLS