Gene Haur_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0190 
Symbol 
ID5732036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp218069 
End bp222409 
Gene Length4341 bp 
Protein Length1446 aa 
Translation table11 
GC content58% 
IMG OID641277314 
Producthypothetical protein 
Protein accessionYP_001542970 
Protein GI159896723 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTGTC CTCGCTGTGG CACGCCCAAT ACTCCTGATC GCCAATTTTG TGGGCAATGT 
CAAGCGCCAT TGGCTGTAAC GCCAAGCGCT AGCAATGATC TACCACCATG GTTGCAAGAT
CTCGATCAGC AACAAACAGT TGTGCCTGCT AAACGTGGCA CATCGAGCCT ACCACCGTGG
TTGCAAGAAG CAAGCCCACC GCCGCCAGTT GCTGCGAATG CCGAGAGCTT GCCGCCTTGG
TTACAAGAAT CGGCAGCGCC AATCCCCGCG CCGAATGTTG CACCAGCACC CGCCAATAGC
GAAGAGCTTC CACCATGGCT CCGTGATTTG CAAGCAACTG CTCCTGCCTT GCCTGCTGAC
GAACCATTGC CAAATGCTCC ACGCTCCGAA CAACCCTTGC CTAATTGGTT ATCCGATTTG
CAGTCAGGTT CAACTCCGCC GCCACCCACA CCAGTGGTCA ACGCTGATGT GCCGTCGTGG
TTGCAAGGCT CGGTCGAGAC AACCCCACCA GCGCCTAAGC CCAGTGCCAA CGCTGATGTG
CCATCGTGGT TGCAAGAGCC AGCGTTGCCC GCCGCTCCCG CCCCAACGCC AGTAGCGCCA
AGCGCTGATG TGCCGTCGTG GTTGCAAGGC TCGGTCGAGA CCGCCCCACC AGCGCCTAAG
CCCAGTGCCA ACGCCGATGT GCCATCGTGG TTGCAAGAGC CAGCGTTGCC CGCCGCTCCT
GCCCCAACGC CAGTAGCGCC AAGCGCTGAT GTGCCGTCGT GGTTGCAAGA ACCAGCATTG
CCCGCCGACC CTGCCCCAAC ACCAGCAGCA CCAAGTGCTG ATGTGCCGTC GTGGTTGCAA
GGCTCGGTCG AGACGACCCC ACCAGCTCCC AAGCCCAGTG CCAACGCCGA TGAAGAGCCT
ATGCCAGCAT GGTTGCAGCA GTTGCGACCG AGTGAACCAG CACCACCACC AGGTATTGCC
TCGTTTGTGG AAGATAATGA AGAACCTGTG ATTGCGCGAC GTGGTGGGAC TGGCAATTTG
CCCGATTGGC TCAAAGATTT CGATACCGAG CCACCTGTGG TGCAATTATC AGATGTCGAT
CTCGATGCAG CTGGGCCAAG CGATGATGTG CCACCTTGGC TCAAGCCTGC AACAGCTCCA
TTAACTTCGC CAGCAGCCCC AGCGGCAAGT AGCGGAGTGC CAGCGTGGAT GCAAGCCGAT
TCCGCCCCAC CAGCACCGCC AGCGGCTCCG GCAGCAAGCG CAGGGGATGA TGTGCCGTCG
TGGTTGCGCG GGGATTTAGA TGTCGTGCCG CCAGCGGCTC CGGCAGCGAG TGGCGATGTG
CCGTCGTGGT TGCAAGCTGA ATCAGCACCA CCAGCGGTTC CAGCAGCAAG TGCAGGCGAC
GATGTGCCAG CGTGGTTGCG CGGGGATTTA GATGTCGCGC CGCCAGCGGC TCCGGCAGCG
AGTGGCGATG TGCCGTCGTG GTTGCAAGCT GAATCAGCAC CACCAGCGGC TCCAGCAACA
AGTGCAGGTG ACGATGTGCC AGCGTGGTTG CGCGGGGATT TAGATGTCGC GCCGCTAGTG
GCTCCAGCTG TGAGTGCAGG GGATGATGTG CCAGCGTGGA TGCAAGCCGA TTCCGCCCCA
CTAGCCCCGC CAGCAGCAAG CACAGGCGAC GATGTGCCAG CATGGTTGCG CGAGGATTTG
GATGTCGCAG CGCCAGCCGC CCCAGCAGCA AGTGGCGATG TGCCCGCGTG GATGCAAGCC
GACGCAGCGC CGCCAGCCCC GCCAGCAGCA AGCGCAGGCG ACGATGTACC AGCATGGTTA
CGCGGGGATT TGGATGTCGC AGCTCCGCCA GCAGCTCCGG CTGCGAGTGC AGGGGATGAT
GTGCCAGCGT GGTTACGCGG GGATTTAGAT GTCGCGCCAG CCGCCCCAGC AGCAAGTGGC
GATGTGCCCG CGTGGATGCA AGCCGACGCA GCGCCGCCAG TCCCGCCAGC AGCAAGCGCA
GGCGACGATG TGCCAGCGTG GTTACGCGGG GATTTAGATG TCGCAGCACC GCCAGCGGCT
CCGGCTGCGA GTGCAGGGGA TGATGTGCCA GCGTGGTTAC GCGGGGATTT AGATGTCGCA
GCACCGCCAG CGGCTCCGGC TGTGAGTGCA GGGGATGATG TGCCAGCGTG GTTACGCGGG
GATTTAGATG TCGCAGCACC GCCAGCGGCT CCGGCTGCGA GTGCAGGGGA TGATGTGCCA
GCGTGGTTAC GCGGGGAGTT GGATGTCGCA GCGCCAGCAG CAAGTGGCGA TGTGCCCGCA
TGGTTGCAAG CCGACGTAGC GCCGCCAGCC GCTCCGACTG CAAGTGGTGG TGATATGCCT
GCGTGGCTGC AAGCGGATGC AGGTGAAACT GTGCGGCTTG ATTCCGCAGA TGCTAGCGCT
GATGTTCCGG CATGGCTCAA AGCTGATTTA GAAGCTGCAC CACCAGCAGC TCCAGCGGCG
AGTGGCGATA TTCCATCATG GTTGCAAGCG GATGCAGGTG AAACTGTGCG GCTTGATTCC
GCAGATGCTA GCGCTGATGT TCCGGCGTGG CTCAAAGCTG ATTTAGAAGC TGCACCGCCA
GCCGCTTCGG CAGCGAGTGG TGGCGATATT CCATCATGGC TGCAAGCGGA TGCAGGTGAA
TTGCCCCAAG TTAGCCCGAC TGTGCGGCTT GATCCTGAGG ATTCCAGCGA TAATGTTCCG
GCATGGCTCA AAGCTGACTT GGAAACGGCC TCACCAGCGG CTCCTGCGCC TATTGATACT
CCCAATTGGT TGCATGAAGA TGCTCCAACG GTAAAACTCG ACAATCAAGC TGCCGGTATT
CCTGCATGGT TACAAGAAGA TCTGCCAACC GTTAAATTGG ATAGCCCTGC AGGGAATATT
CCGGCTTGGC TACAAGATGA TGTAACGCCG ATTGCGCCAG CTCCAGTCAA ACCTGAGCCT
GTTGCGCCTG CGACTCCAAG CTACGATCCA GAGGGTGCAA ATGTCCCGGC ATGGTTGCGA
GACGACTTAG ATATTGAGAT TCCAGCGAGC AAGCCCACCG TAGCCCCAAT TATTCCGAGC
AGTACCTCGC CTTCATGGCT ATTGGATGAT GAGCCAACCC AAGCGACTCA AGATGATACC
TTGTTGGGTA GCGTTGATTT GCCTGCGTGG TTACGCCAAA CGGTTAAGTT AGAGGAACCC
AGCCTAGTTG TTGAACAAGA AGCTGCTTCG GCTGTGGCTG AATCAGCCGA TTGGTTACGG
GTGTTGGGTG AGCCAGAACC TGCGATTGCC GCAACCAGCC CAACCACTCG CCGTTTGATT
GATAGTGAAC CGCCAGCCTT GCTTAGCCGA ACGCCCGAGC GGGTTTCGGC AATGCATTTG
TTGCAAGATT TGGTGACCAA GCCGTTACCT GAGCCAGTCG AAGCACCACC TGTAGCGCTT
GCGCCTTGGT GGCAGCGCAT TGGCACTCAG CGGATTGTCG CCAGTTTGAT GATTATAAGT
TTGCTGGTTG GCTTGCTTGT GCCCAATCTC TTGGCGATCA ATACTCAAGC TTTAACCATT
GGCGGCAATA CCAGTGATCT CTACAACTAT GTGGAAACAC TCAACCCCGA AAGCCGCGTG
CTGATTGCCT ACGAAGGTGA TTTACGCCAT AACGCTGAGC TTGGGCCGTT GGAACACGCA
ATTTCCCAAC ATCTGATCGA GCGTAAAGTG CCGATGTTAT TGCTTTCGAC TGATACTGAA
GGGAGTTTGC TGGCAAGCCA ACGTGCTGCC CAATTTCCAG TGATCGATGC TAACACAGGC
TATACTGGCG AAGGTGCTGA ATACTTGAAT CTTGGTTGGG TCAAAGGCAA TGAACTTGGA
ATTGCCCAAC TTGGCAGTAA TTTGCGTGGC GCAATTGCCA ACCTTTTGCT CAATCGCGAA
GGTGCTGATG TTAATTTGTT GCGAATTATG TCGAAATTGA ATTCCAACAA TCAGCTTGAT
CGCCCACGGA TTACCACGAC CAAAGACCTT GATTTGTTGA TTGTGGTTGC TGATGAGCCA
GGCGATGTCC AGCGCTGGAT GGAGCAATTC TGGGCGCGTG AACCAGCCCT GCCAGTTGCC
ATTTTAACCA CCAATGAAGT CTTACCCCAA ATCCAACCCT ATGTTGATGT TAATGTTAAC
GGCACGCCCG CCGCAATTTA TACGGCAGCA GGCTTGGTTG GCGAACAACA ATATATGGCT
TTACGTTCGA ATAACACCAA TACTAATACC AATGGCCCGA TCACAGCCCT GAGTTTAGGG
ATGATCACCA CGGTTGTGGT GGTGTTAGTC GGTGGGTTGC TTCAGTTGCA ACGGCGTATC
CGTGGAAAGA GGAATAGCTA G
 
Protein sequence
MNCPRCGTPN TPDRQFCGQC QAPLAVTPSA SNDLPPWLQD LDQQQTVVPA KRGTSSLPPW 
LQEASPPPPV AANAESLPPW LQESAAPIPA PNVAPAPANS EELPPWLRDL QATAPALPAD
EPLPNAPRSE QPLPNWLSDL QSGSTPPPPT PVVNADVPSW LQGSVETTPP APKPSANADV
PSWLQEPALP AAPAPTPVAP SADVPSWLQG SVETAPPAPK PSANADVPSW LQEPALPAAP
APTPVAPSAD VPSWLQEPAL PADPAPTPAA PSADVPSWLQ GSVETTPPAP KPSANADEEP
MPAWLQQLRP SEPAPPPGIA SFVEDNEEPV IARRGGTGNL PDWLKDFDTE PPVVQLSDVD
LDAAGPSDDV PPWLKPATAP LTSPAAPAAS SGVPAWMQAD SAPPAPPAAP AASAGDDVPS
WLRGDLDVVP PAAPAASGDV PSWLQAESAP PAVPAASAGD DVPAWLRGDL DVAPPAAPAA
SGDVPSWLQA ESAPPAAPAT SAGDDVPAWL RGDLDVAPLV APAVSAGDDV PAWMQADSAP
LAPPAASTGD DVPAWLREDL DVAAPAAPAA SGDVPAWMQA DAAPPAPPAA SAGDDVPAWL
RGDLDVAAPP AAPAASAGDD VPAWLRGDLD VAPAAPAASG DVPAWMQADA APPVPPAASA
GDDVPAWLRG DLDVAAPPAA PAASAGDDVP AWLRGDLDVA APPAAPAVSA GDDVPAWLRG
DLDVAAPPAA PAASAGDDVP AWLRGELDVA APAASGDVPA WLQADVAPPA APTASGGDMP
AWLQADAGET VRLDSADASA DVPAWLKADL EAAPPAAPAA SGDIPSWLQA DAGETVRLDS
ADASADVPAW LKADLEAAPP AASAASGGDI PSWLQADAGE LPQVSPTVRL DPEDSSDNVP
AWLKADLETA SPAAPAPIDT PNWLHEDAPT VKLDNQAAGI PAWLQEDLPT VKLDSPAGNI
PAWLQDDVTP IAPAPVKPEP VAPATPSYDP EGANVPAWLR DDLDIEIPAS KPTVAPIIPS
STSPSWLLDD EPTQATQDDT LLGSVDLPAW LRQTVKLEEP SLVVEQEAAS AVAESADWLR
VLGEPEPAIA ATSPTTRRLI DSEPPALLSR TPERVSAMHL LQDLVTKPLP EPVEAPPVAL
APWWQRIGTQ RIVASLMIIS LLVGLLVPNL LAINTQALTI GGNTSDLYNY VETLNPESRV
LIAYEGDLRH NAELGPLEHA ISQHLIERKV PMLLLSTDTE GSLLASQRAA QFPVIDANTG
YTGEGAEYLN LGWVKGNELG IAQLGSNLRG AIANLLLNRE GADVNLLRIM SKLNSNNQLD
RPRITTTKDL DLLIVVADEP GDVQRWMEQF WAREPALPVA ILTTNEVLPQ IQPYVDVNVN
GTPAAIYTAA GLVGEQQYMA LRSNNTNTNT NGPITALSLG MITTVVVVLV GGLLQLQRRI
RGKRNS