Gene Haur_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2145 
Symbol 
ID5734047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2702012 
End bp2703577 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content50% 
IMG OID641279286 
Producthypothetical protein 
Protein accessionYP_001544913 
Protein GI159898666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTTAA TTGTTGATAT TTTAATTGAT GATCTGCGGG CCTTGATTCG CGACCTTGGC 
CAAAATGGTG GCCTGATGAG TCCATCAGTC TATGACACAT CCCAAGCGTT GCGGCTCTAT
CCAACGCCCA GCGAAGAGCA TGTTTGGCCA GCAGTCAACT GGCTGATTAG CCAACAACAG
TCGGATGGTG GCTGGGGTAA TCCATCGATG CCGCTCAGTC GAGCAGTGCC AACCCTTGCG
GCAATTTTAG CCCTACGCCG CCACTGTCAG CGTCGTTCAA CCTTCGATGG ATTGCTTGAG
GCCAAACGTT TTCTGCGCCG CCAACTTGAA TATTGGGAGA AACCGCTGCC CGATAACCTG
CCAGTTGGAA TGGAACTCCT GCTCCCTTAC ATGCTTGAAG AGGCCTATCG CGAAGAGCAT
CAAGATGATA TCGACGATGT GCCAATTAAG CTCCGCCTTA ATATCCCCCT TGCACCCTAT
CGCGAGTTGA TCGCACTCGG CGAACATAAA CGCTCATTGA TTCAACAAAA AAAGCCCCGT
GCAGGCACAG CCCCAGTTTA TTCATGGGAA GCATGGGCTA GTCATGCTGA TCCAGAATTG
ATCGATGGCT CAGGTGGCAT TGGTCATAGC CCCGCTGCCA CCGCTGCATG GTTATTTGCT
GCCAATCATA ATCCAAATCT ACGCAACGAA ATCGCTGGCG CAGAAAACTA CCTGCGCCAA
GCGTCGCTGG CCACCTCGGA AAGTGCTCCA TGCATTATGC CAACCGCATG GCCAATCCCA
CGCTTCGAAC AATCGTTCAG CCTATATGCT TTGGTCACTG GCGGAATTCT CGATTTCCCC
AGTATTCAGG ATGTGCTCAA ACCACAAATT GCCGATTTAC ATCAAGCACT CAAGCCGCGC
GGGATTGGCT TTAGCGACGA TTTTATGCCC GATGGCGATG ATACCGCCGC CGCCGTGGCA
GTATTAATCG CAGCAGGCTA TCCAGTCGAT CTCGCGATAT TAAATCAATT TGAGCGTGAA
CCCTACTTCG TAGCCTATCA TGGTGAGTTA CAGCCTTCAA TTTCGCTGAC AGCTCGCGCC
GTGCACGCAC TCGATTTAGC CGGAGTTGAT ATTTCACGCT GGTGGAAGAT TTTTATTGAT
GCTCAAAAAC TTGATGGCAG TTGGAGCGGC GATAAATGGA ATACTTCGTG GCTCTACACG
ACCTGCCATG TACTGATTGC GCTCAAAAAC TCGCCCTACA AAACCGCCAT GAAAGAAGCC
GTCGCTGCAT TACAAGTCCA TCAACATCCT GATGGTGGCT GGGGCATCAT CAATCGATCA
ACCACGGTTG AAACGGCCTA TGCGGTGCTG GCATTGCAAA ACTTACGTGA AGCTGGCCTC
TTAGATGACG ACGACATCCA CATGCTCCAA CGTGGTTATA ATTGGCTCTG TATTCATTAT
CGTCCATTTC GGATGAAAGA GTATCAATGT TGGCTCAATA AAGAAATTTA TTGTCCCCAA
CGGATTGATC GCGCTTATGA GTTAAGTGCC ATGTTAGCAG TCACTCTAGG AGAATTAAAA
TTATGA
 
Protein sequence
MSLIVDILID DLRALIRDLG QNGGLMSPSV YDTSQALRLY PTPSEEHVWP AVNWLISQQQ 
SDGGWGNPSM PLSRAVPTLA AILALRRHCQ RRSTFDGLLE AKRFLRRQLE YWEKPLPDNL
PVGMELLLPY MLEEAYREEH QDDIDDVPIK LRLNIPLAPY RELIALGEHK RSLIQQKKPR
AGTAPVYSWE AWASHADPEL IDGSGGIGHS PAATAAWLFA ANHNPNLRNE IAGAENYLRQ
ASLATSESAP CIMPTAWPIP RFEQSFSLYA LVTGGILDFP SIQDVLKPQI ADLHQALKPR
GIGFSDDFMP DGDDTAAAVA VLIAAGYPVD LAILNQFERE PYFVAYHGEL QPSISLTARA
VHALDLAGVD ISRWWKIFID AQKLDGSWSG DKWNTSWLYT TCHVLIALKN SPYKTAMKEA
VAALQVHQHP DGGWGIINRS TTVETAYAVL ALQNLREAGL LDDDDIHMLQ RGYNWLCIHY
RPFRMKEYQC WLNKEIYCPQ RIDRAYELSA MLAVTLGELK L