Gene Haur_5295 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5295 
Symbol 
ID5737253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp87701 
End bp90250 
Gene Length2550 bp 
Protein Length849 aa 
Translation table11 
GC content60% 
IMG OID641282459 
Producthypothetical protein 
Protein accessionYP_001548050 
Protein GI159901805 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00160972 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCATCC CCGAATCGGC CACGATCCCC ATCAGCCTGC GATCCTGGCT GATGCGCGAT 
GAGTCCATCT ATCTCTACGC TCACCGCATC CCGACCGATC CACCAACGGC GTGGCTCTGG
ACGCTGATTG CCCAGCTGAT CGTCCAGCAT TTGCCGGATC ACCTGCCAAC CCTGCCTCCC
GCTGATGCGC CGTATCCAAC CGTGTGGAAA ACATGGGTTG CCCTGCTGAA CCGCCTTGCA
AGCACCACCA AGCCCATCAC CATCGTCTTG GAGGGGATGG ATCACTGGGA ACGGCCCTGG
GAACAGTTGC CCGTGGTACT TCCGGCGGGA GTCGTGATCA TCATGCCGAT CCCGCTTGCG
CCGAACGCGA TGCCAGAATG GCCTGTTAAT ATGGTCATCA TGCCGATCCC ATCCGACCCT
ATCCCCACCG CAATTAGGGA TGGAATCGAT CACCTTGAAG CGCATGCACG CACCGATCCA
AGCCAATGGA AGACCATCCT CTGCCCGCTG CTCGCCTGCC TGCACGGAGC CTACGCCCCG
CTCACGGTGA CTGCCATCGC CACCATTTTC GCCGTGCCCG TCGAAACCGT GGAAGCCGCC
GTGGCTTGGA TTCCGTCGAT CCTGGCCGTT ACGCTGGATG GCGTGCTTTG CCTCATGCCA
GGATTGGCGG AGATTCCCGC ATGGAATCGG TTGTTCCCCG CGTCCGTCCT CGGCGATGGT
CATCGCCGCC TTGCCGCATG GTGTGGTCAA GCCGAGGCGG TGATCGGCCA AGACACCGTT
GATGATGCCG CGTTTGAACA GCGGCGCTAT GCATCCCATG CCTTTCTGAT GCATTTAACG
GGTGCGCAGC AGGAAAGGCA GATCGGACTT CTTCTTGATG CCGGAGTCTA TGGGCTGGCA
AAAATCCAGG GAGACGGGAC GCGGGATGGG GTAATTGCCG ATCTTATGTA TGGCGTTGTG
CTCCAGACAC CATCACCGTC TATCCCACTC GATGTGGTCC TCGTGCGGGC TTGGCAGTAT
CGGTTGCTGG CATTGCGGCT TCAGACAGCG GTCGAAGAAA CCAATCATGA CCGCTGGCGT
GGCATGGTCG AGCTTGGTTT CGGGACAGAA GCCATGCACC AGATCGCTCA GCTCCCGCAT
CAGTCAGAGC GCATCGCCGC GTGGAGTGCG GTACTCGCCG TCGGGAATAC GGTGCAACGC
CAAGCGATCT ATGCCCAGAT CGTAGCAGCC ATTCCACACG GCCCGATTGA TCCGACCGTC
CTCAGGTCGA TCCTCACCAG CGCGATGGAG CATGGTGATT GGGCCATCCT CGAACCGTTG
GTATCGGCCT ATCCCAACGC GTGGCAACGG GGCATGATGA TGCTCACGAT TGCCCAGGCG
GCAGCCCGTG CCGATCAGCA CCGTATTGCC CGACAGTGGC TGCTGCGCGT TCAAACCGCC
ATTCCGCTCG ATCTCCTCCC CGTCCAACAG CTGGCGATGG TGGGACTGGT CGCCGAAACC
GCTGTCGTGA TCGGTGACCG TGAGACGGCA CGACAGATCG CTGATCGGAT GGACGATCAG
CGATTCCGCA GTCATATCCA GCGGCGTATC ATCGATGCAG CCATTGCTGC GGGTGCCATC
GCCGAAGCCC AAACCATCGT GCAGACGATC CCCAATCCCG CAAGTCGTGC AGCAGGACTC
CAAGCCCTTG CCGTAGATGC AGCGCGGCGC GGCGCGTGGG ACACGGCGCA CCAGTCCGTA
GCAGCAATCG TTGATCATCC AATTGCCGAT GCCACCCGCG ATGCGCTGGT GCAAGCCGCT
ATTGACCAAG GACGGTTGGA TGATGCGGCG TTGCTTGCCC AATCGATCGA TGAAGCCCCC
AGCAAAGTCC ACGCCTTGAC GGCCCTTGGT ACGGCCTATG CCCAGCGTGG TGATTGGATG
CAGGCTGATA CCATCCTTCG GATACTCGAT ACGACCGATG GTGACCAGGA ATCGGCGCTG
CTGGCGATGG TTGTTGCTGC GGCAACAACC GACAATGGAG CCAAAGTTCA CGCGTTGTTG
CCGTTGATTT CCCATGTGCC TGGCCTCTAT CAGGCAATCG TGGCCGCGCA TGAACAGGGA
CATCACGCGC TTCGTGATCA GGTGATACAG TATATGCTGC AGTATGCACA GACCTTCATC
GATGATCAGG AGCAGGCCAG CTTCCTCCAC AACACGCTGT TTCTGTTGGT CCATGTGCGA
GTCTATCGCT TCATGCCACC CTTCCTCGCA GCGATCCGTG ATCCACACGC ACGGGCATGG
ATGGGAACCG ATCTGACCAT GCGGGCCATG GGTGTTCCAG CGCCGCCGCT CCGACAAGGA
CGCGGTTCCA TCGATGGCAT GGATGATACG ATGGCGTTGG TGAATCCCAT CCTCGCCGCA
TGGATCGACG CACCAACGCC ATCGGCACTG TGGGCACGTC TATCCATGGT GCACCCACTC
TTGGCAGCCT ATCCCGATCT TGGGGCCGCG CTCTTGGCAA CCATTCCCTG GGTTGATGCG
ATGGTCGCCC GCCTGACTCG CTACGGATAA
 
Protein sequence
MLIPESATIP ISLRSWLMRD ESIYLYAHRI PTDPPTAWLW TLIAQLIVQH LPDHLPTLPP 
ADAPYPTVWK TWVALLNRLA STTKPITIVL EGMDHWERPW EQLPVVLPAG VVIIMPIPLA
PNAMPEWPVN MVIMPIPSDP IPTAIRDGID HLEAHARTDP SQWKTILCPL LACLHGAYAP
LTVTAIATIF AVPVETVEAA VAWIPSILAV TLDGVLCLMP GLAEIPAWNR LFPASVLGDG
HRRLAAWCGQ AEAVIGQDTV DDAAFEQRRY ASHAFLMHLT GAQQERQIGL LLDAGVYGLA
KIQGDGTRDG VIADLMYGVV LQTPSPSIPL DVVLVRAWQY RLLALRLQTA VEETNHDRWR
GMVELGFGTE AMHQIAQLPH QSERIAAWSA VLAVGNTVQR QAIYAQIVAA IPHGPIDPTV
LRSILTSAME HGDWAILEPL VSAYPNAWQR GMMMLTIAQA AARADQHRIA RQWLLRVQTA
IPLDLLPVQQ LAMVGLVAET AVVIGDRETA RQIADRMDDQ RFRSHIQRRI IDAAIAAGAI
AEAQTIVQTI PNPASRAAGL QALAVDAARR GAWDTAHQSV AAIVDHPIAD ATRDALVQAA
IDQGRLDDAA LLAQSIDEAP SKVHALTALG TAYAQRGDWM QADTILRILD TTDGDQESAL
LAMVVAAATT DNGAKVHALL PLISHVPGLY QAIVAAHEQG HHALRDQVIQ YMLQYAQTFI
DDQEQASFLH NTLFLLVHVR VYRFMPPFLA AIRDPHARAW MGTDLTMRAM GVPAPPLRQG
RGSIDGMDDT MALVNPILAA WIDAPTPSAL WARLSMVHPL LAAYPDLGAA LLATIPWVDA
MVARLTRYG