Gene Haur_0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0900 
Symbol 
ID5732801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1031727 
End bp1033388 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content56% 
IMG OID641278032 
ProductFHA domain-containing protein 
Protein accessionYP_001543676 
Protein GI159897429 
COG category[T] Signal transduction mechanisms 
COG ID[COG1716] FOG: FHA domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTGTC CAAGCTGTGG ACATACGAAT GATGGTGGCA ACCGTTTTTG TGAATATTGT 
GGTGCACGAC TCGATCCGTC GATGAATCAA GAAGCAACCC AAATCGGGGC AGTTCCCAAT
TTACATACGG ATCAAAGTTA CGATGCGCCA ACCATGTTTG TTCCAGCCGA TCAGGCTCCT
CCTGCGCCAC CAGCCCAAGC GGAGCCAGCA GTCGCCAGTG CGCCAGCCGC AAGCTCGCTC
AACTGCGCTG AATGTGGCTA TATCAATCAA CCAGGCGACC GCTATTGCGA TCAGTGTGGA
GCCTCACTCG ATGCCGCTCC TGTCGCTGTT GTTCCAGTGA CTACGCCTGC GCCAGTTGCC
AGTGAAGCGC TTACGCCGCC CGATGGTGTG CCAGCCGTAC CAGTCGCCGA GCCACATTTG
GCTGAATTAA CCAATGTTGC GCCAGCTGAG GAGTTGCCAA CCGTGCCGAT TGATGATCAA
CAGCCAGTTT CGACTCCTGT GGCCGAAGCC GAACCAGTTG CGCCAGCCAT TGAAGAACCA
GTGGTTGCAC CTGTAGCCGA GGCTGAACCA GTGGTGCCTG CGGTTGAAGA ACCAATTTCA
ACCCCTGTGG CCGAGGCCGA ACCAGTGGTT GCTCCGGTAG CTGAAGCCGC CCCAGCAGTT
GACGAAGAAG TCGTAGCAGC CGAACGCACT GCTTTATCAG CAGCGGTGAT TGAGCAAGAA
GATAATCTGG TGATGTTCGA GCAAATGGCT AATCGGTATG CTGGCCGCGC CTTGCCTGCG
CATATCGCCG CTGGCATCGA AGAAACCAAG GCTAGTCTGG CCGAAGCACA AGCTAATTTG
GCGGCGTTTG ATCAAGCCCA AGTTGTAGCC AAAGCTGCTG CTGAAGCCGC TGCTCAGGCT
GCCGCCGACG CTGCGGCTGC GGCTGCTGCT CAGCCCGATC CAGAAGAGGT AGCTCGTTTA
GAAGCAGCAA TCACCGAACA TCAAGATAAC TTGGCGATGT TTGAGCAGAT GTCAGCTCGT
TATGCTGGCC GTGCTTTGCC AGCCCATATC GCTGCTGGCT TGGAAGAAAG CAAACATGCC
TTGGCTGAGG CCGAAGCTGA ATTAGCCGCC TTGCTTGGTG GTGCACCCGC TGCACCTGCT
GCTGCGCCAA TTCCTTCAGC TCCGGTCAAT ACCTATGATG CGCCAACGGT TGCTGCGGCT
GCCCCAGCTG AACCAGCGCC AGCTCCGGTT GTGCCTGCCG AACCAGTGCC AGCTCCAGTA
GTTGAGGCTG TGCCAGCATG GGCTGCCCCA ACGCCTGCCG AGCCAGTCGC TGCTCCGATT
GCTCCACCAG CGCAAGTTAC CCCGCATTTG GTGGTTGCTG GCAGCCAAGT GGTGCTCAAC
TTGCCAACCG ATAAGCAAAT TTATGTGATT GGCCGTGAAG ATCCGATTAG CGGGATTTAT
CCTGAGGTCG ATTTGACCAA TCATGGCGGC GAAGGCGGTG GGGTCAGCCG TCAGCATGCC
CGCTTGCACA ATACTGGCGG CAATTGGACC TTGGAAGATT TGAATAGCAC CAACTATTCC
AAAGTCAACG GCCAAAAATT GGCTCCGCAT GCGCCAGCTC CGGTCAACCA TGGCGATCAA
CTGCAATTTG GCAAAGTTGT TGTGACTTTG CATTTGCATT AA
 
Protein sequence
MKCPSCGHTN DGGNRFCEYC GARLDPSMNQ EATQIGAVPN LHTDQSYDAP TMFVPADQAP 
PAPPAQAEPA VASAPAASSL NCAECGYINQ PGDRYCDQCG ASLDAAPVAV VPVTTPAPVA
SEALTPPDGV PAVPVAEPHL AELTNVAPAE ELPTVPIDDQ QPVSTPVAEA EPVAPAIEEP
VVAPVAEAEP VVPAVEEPIS TPVAEAEPVV APVAEAAPAV DEEVVAAERT ALSAAVIEQE
DNLVMFEQMA NRYAGRALPA HIAAGIEETK ASLAEAQANL AAFDQAQVVA KAAAEAAAQA
AADAAAAAAA QPDPEEVARL EAAITEHQDN LAMFEQMSAR YAGRALPAHI AAGLEESKHA
LAEAEAELAA LLGGAPAAPA AAPIPSAPVN TYDAPTVAAA APAEPAPAPV VPAEPVPAPV
VEAVPAWAAP TPAEPVAAPI APPAQVTPHL VVAGSQVVLN LPTDKQIYVI GREDPISGIY
PEVDLTNHGG EGGGVSRQHA RLHNTGGNWT LEDLNSTNYS KVNGQKLAPH APAPVNHGDQ
LQFGKVVVTL HLH