Gene Haur_5107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5107 
Symbol 
ID5737065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp141871 
End bp144846 
Gene Length2976 bp 
Protein Length991 aa 
Translation table11 
GC content63% 
IMG OID641282272 
Producthypothetical protein 
Protein accessionYP_001547863 
Protein GI159901617 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.286492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATGGAC GCTTAATCAT GACGGTCTAT GGCCGCACGC TGGCTGGAAT TGGCGTGGTG 
CTGCTGGTGC TGGCGCGGCT GCTTAGTCCG TGGCTGCTCG GCCATCTCAC CAGCCCACGC
CTTGCCCCGT TGGCCACGCT GGTCACACGC ATACATGCGA TCAGTGGGTG GCTGCTGGTG
GGCTACGGGC TGTGGGTGCT TGTCACGCTC GTCGCATGGC TGCGGTTGCG GCGACGGCCT
GCGGTGACCG GCCAGTGGTA TCTGCTGCGC ATTCCCAAGC CCATGACGAC CGACCCGCAT
AGTGCCACGG CGGCCCAGCT TGGCCAACGT GAGGCGGCGA TGGCGGGCGA GATCGTGCGG
GTCTTGCAAG CGATTGTCGC GCACGCCCCT GCCGATCGGC GGATTGCCCT CGAACTCTGG
TACACCGCGC AGGGGATTGC CTGGAGCCTC TGGCTCTCGC ATCCCGATTT GTATGCGCCC
GTGTTCAGTG CGATGCAGGG CTTTGTGCCC GGCGGCGATC TGCTGGCCCA GCCCGATCCG
GTTGCCACCG CTGCTCCCGC GCTGGAGTGG ACGCAGCACT CCCTTGCGCA GCCTGCGATC
TATCCGTTAC GCCGTGCCGA TGCGATGGTT GGCGACCCGC TCGATGCCCT GATTGGGGGC
TTGCGACCGC AACAGGGCGT GACCGCGATT GGCCTATCAC TGACCCTTGG AGCCGTGCCT
GATGGCTGGC AGCGCCAAAA CCGTGCGCTG GCGGATTGGC TCGCCCTTGA TGCCAAGGAG
CAGCAGGCCA GTGCCAGCAA AGAGACCGCT GTCGCGATGT TGGCCAAGCT GCAAAGCCCA
TGCGTGACCA TTGCGCTGCG CACCGTCGTG CTGGCCGAGA GCGCCGCACT CGCCACCGCC
CAACACCAGA TCCTGCTCGG CGCACTCGAC ACCTATACCA TGCAGCACGA CTCCATGCGT
CAAGCCTGGA CCCATGGCCG TATCCACACC ACGATCACCC CCGCCGTGCG CGACCGCCAT
ACCGTGCGCT GGTTGGCCAT GCCGTTGCCG CCGCTCGTAC CAACTCCCAC GCTGCCCGTG
CTGAGCCTGC CCGAAGTCAC GGCGCTCTGG CATCTGCCAA CGGGTCGCCA TTTGACCGTG
GCAATCTGGC GCAATAATCG CTTTGCCCCG CCTAGCCCGG CCTTGATGGT CAGTCCCATC
GGGGAGCCAC CCCCAGCGTT TGCTGCGACC ACTCCCTTAA AGGATCGCAT CGTGCGGATC
ACGGGCGGGG TCGCAGCGGA TGGCCAACCC GTCTATTTGG GCTTTCCGCT CAAAAGCTAC
ACCTTCCATC AGCAAATTAC CGCCTCAACG GGGGCGGGTA AAAGTCACGC GGCCAAGGTG
CAATTGGGTG AATTGCTGCG GATTGGGGCA GGCTTTGGCG TGATCGACTT CAAGGGGGAT
TTGGTCAATG ACCTGTTGAC CATGATCCCC GATGATCGGC TCGATGATGT GATCGTCTTT
GACCCGCTTG ACCCTGACCA CTGCCTTGGA TTGAACCTGC TCGATCCCAC CTATTTAAAT
AAGGATACCG AACCGGATTT TTTGGTCGAA TTGCTCGAAG CCTTGGTGGC GAGTACGGAT
AGCCACTGGG GTGATTCGGC GGGGATGCAG GAAGCCCTGC GCTATGGGGC CTTGACCCTG
ATGGAAGGCG AGCCAACGCC GACGATTGCC CACTTGTATT TGCTCTTTTC CAGTAGTGCC
TACCGCGCGA CCGTCCTAGA GCGGGTGACT GACCCCGACT TGGTGTTGTT CTGGACCTAT
CAATTTCCCA AGCTATCGGA CACCCAAAAA TCCTCAATGA CCGCGCTCCA GCGGCGCTTG
AGTCAGTTTT TGATCAATCG CACGGTGCGG ATTACCTGCA ACCAGACCCG CACCACCATT
CCCTTCCGGC AGGTGATGGA TCAGCGCAAG ATTGTGCTGG CCAAGTTGCC CGTCGAATTG
ATTGGGGCCA CCGCTGGCGG GATTCTGGCC AATGTGCTGA TTAACCTCGT GTTGGCGGCG
GCGTTTAGTC GCTTGGATAC GCCCGAGGAC GAACGCGAGC CGTGGGTGCT GGTCGTCGAT
GAGTTTCAGG AGGCCATGCG GCGCGGCGAT CCGGCCAATT ACCAGAAGAT TCTGGAACGC
TTACGCTCGT TTGGGATTGG CCTGGTCTTG ATGCACCAAG GCACGAGCCA ACTGCCTGCG
GAGATGTTGG CCACGACGCT GGAGATTGTG CAGACGCGCT TGATCCTCTC GGCCTTTGGC
CCCGATGCCG CCGTCTGGCA GCGCCAATAT CCCGATGCGC GGCTGACCAT GCAGGACTGG
GCCGGGATTC CCCTCCGCGA CGAAGGCTAT GCGGCGATCT CGATCAATGG CTTTCGCCAA
CCAATCTGCA CGATCACCCC GCTGCCGCTC TGGCCTGCCC TGCCGCAAGG GAGAACAACC
AGAACCCTGA CCACGCTGGC TTCCCCACCG CGTGATCCCC TCGACGAACG GCTGCGGGCA
CTGGCCCAGT TTGATCGTGC GACGCGGCTG CGCTTGCTCC GCGATGCCCC ACCCGCGCTG
TGGGAGGCCA TCCTCGCCCG CATGGAGCAG GATCGGGCTG ACCGCCATGC CCAGCTGCTC
GCGCCGACCT GTCCCTTGCC GCGTGCCGCC CGCGTGCTGG CCTTGAGCCA TGCCAAAGCC
GCAACCCCAC GGGATCTGGC GCTCGCCGCC TTAACCCGCC TGACCCTGAG TATTCCGCGT
GATACGGTGG ACGACGAACC GCCTGCCAAG GGCAAACGCG GACGCAGTGG CGCGGCGGAA
TCTCCATCGG AAGCGGCTCC CGCAGGGAGT CCCGTGGCAA CGCCAGCAGT GGTGGTTAAT
CCGCGCAAGG TCGGCGTGGC CACCGTGATG CAGGCCGAGG CGACGGCCAC CAATCTCATG
CACTTGTTAG CGGAGAATCC TGATGCGGGT TTATAA
 
Protein sequence
MHGRLIMTVY GRTLAGIGVV LLVLARLLSP WLLGHLTSPR LAPLATLVTR IHAISGWLLV 
GYGLWVLVTL VAWLRLRRRP AVTGQWYLLR IPKPMTTDPH SATAAQLGQR EAAMAGEIVR
VLQAIVAHAP ADRRIALELW YTAQGIAWSL WLSHPDLYAP VFSAMQGFVP GGDLLAQPDP
VATAAPALEW TQHSLAQPAI YPLRRADAMV GDPLDALIGG LRPQQGVTAI GLSLTLGAVP
DGWQRQNRAL ADWLALDAKE QQASASKETA VAMLAKLQSP CVTIALRTVV LAESAALATA
QHQILLGALD TYTMQHDSMR QAWTHGRIHT TITPAVRDRH TVRWLAMPLP PLVPTPTLPV
LSLPEVTALW HLPTGRHLTV AIWRNNRFAP PSPALMVSPI GEPPPAFAAT TPLKDRIVRI
TGGVAADGQP VYLGFPLKSY TFHQQITAST GAGKSHAAKV QLGELLRIGA GFGVIDFKGD
LVNDLLTMIP DDRLDDVIVF DPLDPDHCLG LNLLDPTYLN KDTEPDFLVE LLEALVASTD
SHWGDSAGMQ EALRYGALTL MEGEPTPTIA HLYLLFSSSA YRATVLERVT DPDLVLFWTY
QFPKLSDTQK SSMTALQRRL SQFLINRTVR ITCNQTRTTI PFRQVMDQRK IVLAKLPVEL
IGATAGGILA NVLINLVLAA AFSRLDTPED EREPWVLVVD EFQEAMRRGD PANYQKILER
LRSFGIGLVL MHQGTSQLPA EMLATTLEIV QTRLILSAFG PDAAVWQRQY PDARLTMQDW
AGIPLRDEGY AAISINGFRQ PICTITPLPL WPALPQGRTT RTLTTLASPP RDPLDERLRA
LAQFDRATRL RLLRDAPPAL WEAILARMEQ DRADRHAQLL APTCPLPRAA RVLALSHAKA
ATPRDLALAA LTRLTLSIPR DTVDDEPPAK GKRGRSGAAE SPSEAAPAGS PVATPAVVVN
PRKVGVATVM QAEATATNLM HLLAENPDAG L