Gene Haur_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2549 
Symbol 
ID5734427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3276091 
End bp3277719 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content46% 
IMG OID641279689 
Producthypothetical protein 
Protein accessionYP_001545315 
Protein GI159899068 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.170922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACATG CAACACGGAT TGGGCTAGTA ACCAGTGGCT CGTTATTGGA AGGCTTAACT 
GCCCGTTTAG ATGAACGCTA CGAAATTGAG CGTTTGCGCG TTGGTCAATT TATGGTGGTG
CAAGGCCGCC AAAACCGCTT TTTTTCAATG TTGACCGATG TGCAACTGGC CGCTACCAGC
CTTTCGATTT TGGCCGATCC GCCAGATGAT GAGCACCCGT TGTTGCGTGA GATTTTGGCT
GGGCGTAACA CCTATGGCAC ATTTAAATTA ACCCCTCAAT TGATGTTGCC CGAAGATTCA
CTCGAAACAC CTCGCCCAGT TAAGACTATT CCTGCCCATT TTGCGCCAAT TTACGAGGCC
AGCGAAGATG ATTTTGGCTT GGTGTTTGGG GCTGAGGGCG ATGGTAAGTT TCAAATGGGC
ACGCCGCTGG ATATGGATGT ACCAGTCTGT ATCGATCTTG AGCGCTTTGT TGAGCGCTCG
AATGGTGTGT TTGGTAAATC GGGCACAGGT AAATCATTCT TAACGCGTTT ATTATTATGC
GGCGTGATTA AACATAATGC TGCGAGTAAT TTGATTTTTG ATATGCACTC CGAATATGGT
TGGAGCGGCA CAACCGAGGA TAAGATTCAA GAAGTTAAGG GCTTAGCGCA GCTTTTTCCT
GGCCAAGTCT ATATCTACAC GCTTGATCCT GAGTCGTCGC GGCGGCGCGG AGTAAAATAC
GATGGTGATA TTACGATTGG CCTGAATGAA ATTCGGGTTG ATGATATTTT ATTATTGCAA
GATGCGCTTA ATCTCAATCC GACTGCGGCA GAATCGGCCT TTATTTGTGC TCAGCGTTTT
GGCGACGATT GGATTCAAAA ACTGCGTGAA TTAGATACCG AACAACTCAA AGAGTTTGTC
GAATCGACAG GCGCAAATAT GTCGTCGATG TCAGCACTTT CGCGCAAACT AGCTCAGCTT
GAGCAACTCA AATTTGTCAC TCGCAAATCG AGCCAATCAT CAATTCGTCA AATTATTGAT
GCCTTGTTGG CTGGTAAAAA TGTAGTGGTA GAGTTTGGTC AATATCGCAG TGAATTGGCC
TATATGTTGG TTTCAAATAT TCTGACGCGC CTGATTTATG ATGAATGGGT ACGGCGTACC
GAAACCTTTC TAGCCACAAA AAAATCGAGC GATAAACCGC CGCAACTGAT GATTACGATT
GAAGAAGCGC ATAATTTTCT TACGCCCAGC CTGGCCAAAC AAACCATTTT TGGCAAAATT
GCCCGCGAAT TACGCAAATA TTCGGTCACG TTGTTGGTGG TTGATCAACG GCCATCGTCG
ATTGATAACG AAGTGATGAG CCAACTTGGC TCGCGGATTA CTGCTTTGCT TAACGATGAT
CGTGATATTG ATGCGGTATT TATGGGTGTT GGTGGCTCGA AAGGCTTGAA AACCGTGTTG
GCCTCGCTTG ATTCGCGCCA GCAAGCCATG ATGTTGGGCC ATGCCGTGCC GATGCCAGTG
GTGATGCGTA CCCGAGCCTA TGATAAAGCT TTTTATGAAG CGATGATGCA GGGTAATCGA
CGACGGCCTA AGCCTATTCC AATTACCGAT GACGATGCTA ATGATGATTT ATTTGGATCA
AGGCAATAA
 
Protein sequence
MAHATRIGLV TSGSLLEGLT ARLDERYEIE RLRVGQFMVV QGRQNRFFSM LTDVQLAATS 
LSILADPPDD EHPLLREILA GRNTYGTFKL TPQLMLPEDS LETPRPVKTI PAHFAPIYEA
SEDDFGLVFG AEGDGKFQMG TPLDMDVPVC IDLERFVERS NGVFGKSGTG KSFLTRLLLC
GVIKHNAASN LIFDMHSEYG WSGTTEDKIQ EVKGLAQLFP GQVYIYTLDP ESSRRRGVKY
DGDITIGLNE IRVDDILLLQ DALNLNPTAA ESAFICAQRF GDDWIQKLRE LDTEQLKEFV
ESTGANMSSM SALSRKLAQL EQLKFVTRKS SQSSIRQIID ALLAGKNVVV EFGQYRSELA
YMLVSNILTR LIYDEWVRRT ETFLATKKSS DKPPQLMITI EEAHNFLTPS LAKQTIFGKI
ARELRKYSVT LLVVDQRPSS IDNEVMSQLG SRITALLNDD RDIDAVFMGV GGSKGLKTVL
ASLDSRQQAM MLGHAVPMPV VMRTRAYDKA FYEAMMQGNR RRPKPIPITD DDANDDLFGS
RQ