Gene Haur_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2131 
Symbol 
ID5734019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2673973 
End bp2676378 
Gene Length2406 bp 
Protein Length801 aa 
Translation table11 
GC content45% 
IMG OID641279272 
Producthypothetical protein 
Protein accessionYP_001544899 
Protein GI159898652 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACTA TTGAACAGCA GCTTGAATTG TTGCCGCGCT TGTGGCGCTA TAGCCTGCTG 
CGCACCAGCC TCACGGCCCA TGCCGATCGT TGGCCTGATG AACTCTATGT TATGTTGGCA
ATCATTGGAC GTTTACCAGA GGCCTTGGCA CAGATTAATG TTTGCGCTTA TCCAGAACGT
CAAGTGCTTG CTTGGGCGCG AGTGATTAAA TACGCCGAGC CGCAGGTTCA ATTACAGCTT
TTGCAGCGCA TCGAGCATTC GATTCGTTCT ATCCGTGATC CTGAAGATCA GACCTTTGCT
TTGTCGTGTC TTGGCCTGGC CTATGCTGAG GCTGGTATTC CTAATGCGAC CTATCCAATC
TATCAAGTTA TTGATCGGCC TAATCTAACG GTTCAAGGGT TGTTGCTGCA GGCAAATAGG
TTGGCTAGCC AAGGTTTGTC TGACCAAGCG TATTTACTTT TTGATGAACT ATTCATGACG
ATTTTCGCCA TGCCTCAGTA CGAGCAATTG TACCAATTAA TGTTGCTTGT TCAGAGCGCC
AAGCGTGCTG GGTATAATTC GCTTTGTGAG CGGATTATTC AGAGCCTCTA TGTTCCGCAC
GAAGCCCCTA CGTTCAACTC GGCAATTCAG GAGCTTGCTA AAGCCTATGC GGCATATGGT
GATTTTGCCG TTGCCCATCA GGCGATTCAA TTGATTAAAC AGCCGCGCAG TTTTATCCAT
GCGGCCCGGC AAGTGGCGGT GATTGCCTGT GAAAAACAAA TCCATACCCA TACAGCAAGT
TTGCTACAAG CAGCCCATGA ACGTGTTAAG CAGCTTGAGG ATATTGATGA ACACATTTAT
CTGTTGGGAC AACTAGCCAT CCCTGCACGG CAAGCTGGCT TAATCGAGCT GGCCCAAACC
TTGATGGATG AAGCTTTTCA TAAACTAATT GGTGTGCAGC ACCAATATCC GCCTGTAGCT
ACTAGACTGC TTATTCAGAG TTATCAATCT CAACATGCCT TGGCTGATGC CTTAGCGATC
ATACCATTCC TTGATAATCC CCAAGCCCAT GATTACGTGC TTGGTAACAT TATTGAGTGC
TATTTGAATG ATAATGATCT TACGAATGCT CATCTGCTGT TGAAATTATT TAAACCCCAT
GAACAGACAT ATGTAGCTGC CGCTAGTAAC CTATTGATTA AGGCTGGTGC GCAGGGATTA
ATCGAACTCG TTTGGCGGCT TTATTGGGAT GTCATGGCTA TATCTAAAGC GATTAATGAT
CATGTTAATC ATCGTAATTA TTTTGTCTAC GTTGCCTGTA ATCTTGCGAC TGAGGCGAGC
ACGCATGGGT TAACCGTGCT AACGCCGCGA TTGTATAGCG AGGCGATTCA GGCATGTACC
ACAGTTGATC ATGGATATAC TCGGCTGCGG TATCTCAAGG ATTTAGTGCT TGCTCAGATG
AAACATGGTT TGGTTGCGAG TTTCCCCAAT TTGTTAGCTA GTTTACGCCT AGGAGCTACC
CAATTAGAGA TTAATACTGC ATTGAATGAA TTTCTTTGCC CAATCGCGGT GGTTTATGCT
GAACAGGGGA ATTATTCGGC ATTTGATGAT TGGTTCAATT ATGCCCATAC CCAATTGAAA
AATAGCACCC AAACTGATCA TAAAGCGCTT GTGTCGGGCT ATCGAACGCT GATTAAGACC
TATCGTACAT CTGCTTCTGA TTGGATGAAT TCAGCATTTT TAGCAGCGGT GTTGCCAAGG
TTGCAGGTGA TCGCCAACAC AACACATTTG GCTAGTGCTA AAAATCTATT GATAAACATT
TATGCGGATT ATGCGAGTGA AGGGCATCCA GCCTTTCTCG CTCAAGCATA TGAGATGGCG
ATTACAATTG AACCGATAGC TGATCGGCTC AATGCGCTTA AATCACTTGC CAAGGTCTAT
GCGAAAGTAA ATGATGGGCC GCATTTACGG GCAATTATTG CTGAGATGAT TGAGCTTGAG
CTTGATGATT TAGAGTTTGA GTCGATTGCC TTGGTCTGCG CTAAACAGGG AGATTTTGCC
TATGCCCAAG AATTGCTGGC ACGCCAAGAG CCAGCCCCAT GGAAAGATGA AGTTTTATGG
TATTTGATTG CCAAGCTGAT TCAAACCAAT CAAGTGGCTA CTGCATGTCA GTTAATTCCA
AGCCTAAGTG AAGGCTACAA ACAAGAACGC GAGTTTCAAA AAATCATTAC CTACTATCTT
GAACGTCAAC AATTGGCTGA AATTGGGCAA ATTGTTCAAG ATGTTTGGCG TAACTGTATG
AGCGATACTG AATTATGGCA ATTAAGTACA ATCATTGTGC CGTTGATTCC CCACTACCCA
TGGCTTGGCA TTGCCGTGCT TGATAGCGTG CCATGGGTTG AACAGCAGTT AGCTCGCTTG
AAGTAA
 
Protein sequence
MDTIEQQLEL LPRLWRYSLL RTSLTAHADR WPDELYVMLA IIGRLPEALA QINVCAYPER 
QVLAWARVIK YAEPQVQLQL LQRIEHSIRS IRDPEDQTFA LSCLGLAYAE AGIPNATYPI
YQVIDRPNLT VQGLLLQANR LASQGLSDQA YLLFDELFMT IFAMPQYEQL YQLMLLVQSA
KRAGYNSLCE RIIQSLYVPH EAPTFNSAIQ ELAKAYAAYG DFAVAHQAIQ LIKQPRSFIH
AARQVAVIAC EKQIHTHTAS LLQAAHERVK QLEDIDEHIY LLGQLAIPAR QAGLIELAQT
LMDEAFHKLI GVQHQYPPVA TRLLIQSYQS QHALADALAI IPFLDNPQAH DYVLGNIIEC
YLNDNDLTNA HLLLKLFKPH EQTYVAAASN LLIKAGAQGL IELVWRLYWD VMAISKAIND
HVNHRNYFVY VACNLATEAS THGLTVLTPR LYSEAIQACT TVDHGYTRLR YLKDLVLAQM
KHGLVASFPN LLASLRLGAT QLEINTALNE FLCPIAVVYA EQGNYSAFDD WFNYAHTQLK
NSTQTDHKAL VSGYRTLIKT YRTSASDWMN SAFLAAVLPR LQVIANTTHL ASAKNLLINI
YADYASEGHP AFLAQAYEMA ITIEPIADRL NALKSLAKVY AKVNDGPHLR AIIAEMIELE
LDDLEFESIA LVCAKQGDFA YAQELLARQE PAPWKDEVLW YLIAKLIQTN QVATACQLIP
SLSEGYKQER EFQKIITYYL ERQQLAEIGQ IVQDVWRNCM SDTELWQLST IIVPLIPHYP
WLGIAVLDSV PWVEQQLARL K