Gene Haur_5050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5050 
Symbol 
ID5737008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp66651 
End bp67958 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content60% 
IMG OID641282215 
Productintegrase catalytic region 
Protein accessionYP_001547806 
Protein GI159901560 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCACGG ATCGCTGGTT TGCTGCTCGC CGAACGTTGT ACGAATTGCT CCACACGAAC 
CCCGACTGGT CGAATCGCCA GTTTGCCACC GCGCTCAACG TCTCCCCCGA TTGGGTTCGT
CTCTGGAAAC AGCGCATCGG TTCGCCACCC CATCCCGATC CTGATGTCGT CTGTCAGAGC
CAATCACGGG CACGCAAAAC ACCACCACCA GCGTGGAGCG ATCGCGTCAT TCACCGGATT
TTGACCTTGC GCCAGGAGTT GGCCGCGCAG TTCCATCGCA CGGTTGGAGC CAAAACCATT
CTGGCCTATC TCCAACGCGA TCCTGATCTC GCCGATGACC GGATTCCCCG TTCACCCACT
ACCGTGAATC GCATCTTGCG CGATCATCAG CTGCTCGTAG ACCCACCCAC GCATCAGCGC
CAACCCCGCA CCCCTTGTCC GCCCATGCAG GAGATTGAAA TCGATTTTAC CGATGTCACG
ACGATTCCGA CCAACCCCGA TGGCAAACGT CAGCACGCCG CCGAAGCCTT TATGTGGGTC
GATGCGGGCA CATCCATCCG CGTTGCCGCG CGGATTAGCA CCGATTTTCA TATGGCCTCG
GTGATCCGGA CGACCGCCAG TATTCTCCAG CAGATCGGGT TGCCTGCGCG GATTCGGATG
GATTGCGATG TGCGCTTGGT CAGCAACAAG CGCGTCGCCG ATTTCCCATC GCCCTTCCAA
CGCTTGTTGC TCAATCTCGG CATTCAGGTT GACGTGTGTC CACCCCATCG ACCCGACTTA
AAGCCGTTCG TGGAGCGGTT TCATAAAAAC TACAAGGGCG AATCGGTCTA TCCAAACTGG
CCGACGACCG AGGCCGAAGC CCAAGTCCAG GTCGATGCCT ATTGCGATTG GTATCGTACC
GAGCGCCCGC ACCAAGGCCG GGCCTGTGGC AATCGCCCGC CTGCCGAGGC GTTTCCAGAA
TTACCCGTGT TACCACCGGT TCCGGCGCAG GTCGATGCGG ATGGCTGGCT GAAGCAAATT
GACGGCTGGA CGTTTGTTCG GCGGGTCAAT GCGCAAGGCA AGCTCATGCT GGATGGCGCA
ACGTATACGG CGGGGATCGC CTATGCAGGG CAGGAATTGG CGGTGCAGGT GGATGCTGCC
GCGCGGGAAT TGGTGCTGAT CCAGCGTGAA CGCGCGGTCA AGCGGGTCAC GTTGAAGCGG
CTCTTGGGTG GGATGATGCC GTTTGAGCAG ATGGTTGAGG CATTGTGTGG CTTGGCTGCG
CAGGAAACCA AACGGCTCAA CCAACGCCAG CAGCGCCGCC GCCGATGA
 
Protein sequence
MVTDRWFAAR RTLYELLHTN PDWSNRQFAT ALNVSPDWVR LWKQRIGSPP HPDPDVVCQS 
QSRARKTPPP AWSDRVIHRI LTLRQELAAQ FHRTVGAKTI LAYLQRDPDL ADDRIPRSPT
TVNRILRDHQ LLVDPPTHQR QPRTPCPPMQ EIEIDFTDVT TIPTNPDGKR QHAAEAFMWV
DAGTSIRVAA RISTDFHMAS VIRTTASILQ QIGLPARIRM DCDVRLVSNK RVADFPSPFQ
RLLLNLGIQV DVCPPHRPDL KPFVERFHKN YKGESVYPNW PTTEAEAQVQ VDAYCDWYRT
ERPHQGRACG NRPPAEAFPE LPVLPPVPAQ VDADGWLKQI DGWTFVRRVN AQGKLMLDGA
TYTAGIAYAG QELAVQVDAA ARELVLIQRE RAVKRVTLKR LLGGMMPFEQ MVEALCGLAA
QETKRLNQRQ QRRRR