Gene Haur_3149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3149 
Symbol 
ID5735021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3976873 
End bp3978468 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content49% 
IMG OID641280292 
ProductATPase-like 
Protein accessionYP_001545914 
Protein GI159899667 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACATC CTTGGCCAAA TAAAATGCTC CGCCAATTAC TCCGCGGCGA GATGTTATCG 
CAAACGTTGC AGCAGTGGGT CGATAATCAT GGTGGTCGGC AGGTAGTTGT GCAAACGTTG
CGCGATGGCA TGAGCCACGA TGTTGATGCA CTTGCCTTCT TTGAACAGCA GGTAAATCGT
TATACCACAG AAAGCCAAAA AGCTGCAAAT CTTAATATGG CCGATAGCAC CTACTATGCC
AAACGTGGCC GTTTTTATGA GCAATTACGC CATTTGCTCG ATCATCTGCC TAGCCTGCGT
CCTACCACGA TCATTGGTCT GCCGATTCCG CTTACGCCGT TGGTTGGCCG CCAAGCAGTG
TTGCAAGAGT TGCTTGAGCT TTGCCGCGAA TATCGCCTAG TAACCTTGCA TGGCATTGGT
GGGATCGGCA AAACCCGCCT CGCGATTGCC CTCGCTAGCT ATATTGCTAA TGCTGGCTTT
GCCCAAGAAG TCGTTTTTAT CGATTTACGC AATGAATATA CGGTGCATGA TAGCTGGCAT
GCCTTGCTTA ATCGCTGGCT CGGCGACCCG AAAGCCGATT TAACCAGTTA TATTCAGCAA
TCCAATCGCC GCACGGTCTT GATTATCGAT AATTGTGAGC ATATTCGGGC AGTTGCTGAG
ATGCTGCTGC CATTGCTCAA TTTTGGCAAT ATTTCGATTA TTACCACCAC GCAAATTGCC
TTGAGCATCA ATGGCGAACG ACGCTTTCCC GTGCCAGCCT TGAGCCTCGA AGAAGGCATT
TTGTTGTTTG AGCAACGTGC TCGCGATTTG AACCGTCAGG TTGAGCGCCG TCAAACTGAG
CAAATTGTCC AGCGGTTGGC GGGGCATCCC TTGGCGATTG AAATTGCCGC TTCACAATTG
TTGTTGGTTT CGCTCAACGA TATTTTGGCC ATGACCAACG TCGAAATGCT GGAGATCGAA
TCGTTGGCGA ATGGCTCGGC CTCGCATCGC ACCTTGCGCC AAATGGTTGA GTATACGTTT
TCGTTGTTGC ATGATGATGT TCAAGCTGCT TGCCTCCGCT TAGCACTATT TGAACATCAT
TTTAGCTTGG CCCAAGCAAC CCAAGCCTTT AAAGTCAATT GGCGGGTGGC TAGCGGTTTG
GTTGATGCAT CGTGTTTAGA GGGCAGTGTC GATAGCCGTG GCGAAACGCG GTTTAGTATG
CCAATTGTGA TTAGGCTCTA CGCTCGGCAA GTAGCGCTCG AACGCGATGT CTATCATCAA
TTGATGCTGG AATTTACCCA ATATTGGGCC AATGAAATTA AGCAGATTTT GCAACGCTGG
GAGCAAACGA TCGAACGCGC CAAAGATCAT GGCGAATTGC TCACGCCGAT GAACGATCTT
TCGTTGTTGC GCGAGAGCTA CACAACGATT CGTGGCTGTT TGCAATGGGC GGCGATCTAT
GAGCATACGC AATTATTGGA GTTAGTCGGC GGTTTGTGGA AGTTTTGGCT GCATTATGAA
AGCCCCGAGG GTCGTTTATG GCTGCGTTCG GCCATGAGTA TGGCGAGCGA TCAGCAACGC
AGCGAATTAC AACAAATTCT ATTGCTGTTT AGCTGA
 
Protein sequence
MPHPWPNKML RQLLRGEMLS QTLQQWVDNH GGRQVVVQTL RDGMSHDVDA LAFFEQQVNR 
YTTESQKAAN LNMADSTYYA KRGRFYEQLR HLLDHLPSLR PTTIIGLPIP LTPLVGRQAV
LQELLELCRE YRLVTLHGIG GIGKTRLAIA LASYIANAGF AQEVVFIDLR NEYTVHDSWH
ALLNRWLGDP KADLTSYIQQ SNRRTVLIID NCEHIRAVAE MLLPLLNFGN ISIITTTQIA
LSINGERRFP VPALSLEEGI LLFEQRARDL NRQVERRQTE QIVQRLAGHP LAIEIAASQL
LLVSLNDILA MTNVEMLEIE SLANGSASHR TLRQMVEYTF SLLHDDVQAA CLRLALFEHH
FSLAQATQAF KVNWRVASGL VDASCLEGSV DSRGETRFSM PIVIRLYARQ VALERDVYHQ
LMLEFTQYWA NEIKQILQRW EQTIERAKDH GELLTPMNDL SLLRESYTTI RGCLQWAAIY
EHTQLLELVG GLWKFWLHYE SPEGRLWLRS AMSMASDQQR SELQQILLLF S