Gene Haur_2159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2159 
Symbol 
ID5734032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2722500 
End bp2723801 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content49% 
IMG OID641279300 
Producthypothetical protein 
Protein accessionYP_001544927 
Protein GI159898680 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATAC CGACAACACA ACAATGGACA GCCGACGTTG TACATACGCT GTTAAACTAC 
CCGCAACGGC TGCTGGAGCA TCCTACTTGG GCGCTGGTTA TCGCCGAATT TGGCGGCATT
CGGCAACTGC GTCAACACCT CTTAACTTAT CCATTTAAAG CCAAAGAATT GCGTTTGCTC
AAGGTGCTGC TCGACTATCC CGATGCCGCA GTTGAATATT ATTGCGACCT GTTGGCAATG
CATCCGGCTA CCTTTCATCG CCAACACAAG GCACTCTGCC AACGCTTGAG TGGTTTATTG
CCTGTGCCAC GTGCCGATGA TCAACCTGCC CCCGAATTGG CGCAATTGCC CTATATCAAG
CCTAAAACCA GCTTTATTGG CCGAGCGCTC GATCTTGAGC GAATTCAATT GCTGTTTGAT
CAAGGTTGTC ATTGGATTAG CTTGGTTGGC GCGGCTGGAA CGGGGAAAAC GCGCTTGGCC
TTGGAAATGA GCCAACGGGT TAGCTCAATG TTTGGCGATG GGATTTGCCT ATTGCAGCTG
AATGCTGGCG TTGAGCTAGC AACGCTGGCT GAATACTGTT TAAGCCAACT TGGGCTTGAG
CCGTTATGCG ATGATCCACG CCAGCGGTTT CAAGCCTATT TTGGTTCACG CCAAATCTTG
CTGATTCTTG ATAACCTGGA TCAGCCAGAG CTTGCAACTT GGTTTGAGGA CACCTTACAA
GCTGCGCCGT TTGTGCGGGT TATATCCACT GGTTGCCAGC GCTTAAATGT GCCGAATGAA
TGCTTACATC ATGTTGAGCC GCTTAACTAT CCACAGCATG ATGCTCAACC TACCTCACTT
GCTGAGAATC CTGCCCTGCA ACTCTTGCTT GAACGATTAA CTCCATTTCA GCCAATTGAT
CTGACCAAGC TAGAACAGCG CAGAATGCTC ATCCAGATTT GTCAGCTGCT TGACGGAAAG
CCGCTGGCCT TAGAGTTAGC TGCTGGTTTA GCTGTAACCC ATGATTTAGC GACGCTTGTG
GCCCAACTTC AACTGATCGA TGCACTTAAT GCTGCGTCTG AAGCTCTAGG ATTGCTCATT
GCACTGAGCC ATGCTGCGCT CCAACCAACA ACGCAACAGC TTTTGGCGCA ATTGTTGAAG
CTGGCGCAGC ATGCATGGCG AACTGAACTT TATGCTTCAA ATGAAGTTAA GCCTAGTGAG
ATTGCTAGCG GTTTACAGGA AGCCCAAATT AAACATTTCT TGATTGATTT AGGGCATTGG
TATGCAATTC CAGGGAGTAT CCAACGCTTT ATCGCTGGTT AG
 
Protein sequence
MTIPTTQQWT ADVVHTLLNY PQRLLEHPTW ALVIAEFGGI RQLRQHLLTY PFKAKELRLL 
KVLLDYPDAA VEYYCDLLAM HPATFHRQHK ALCQRLSGLL PVPRADDQPA PELAQLPYIK
PKTSFIGRAL DLERIQLLFD QGCHWISLVG AAGTGKTRLA LEMSQRVSSM FGDGICLLQL
NAGVELATLA EYCLSQLGLE PLCDDPRQRF QAYFGSRQIL LILDNLDQPE LATWFEDTLQ
AAPFVRVIST GCQRLNVPNE CLHHVEPLNY PQHDAQPTSL AENPALQLLL ERLTPFQPID
LTKLEQRRML IQICQLLDGK PLALELAAGL AVTHDLATLV AQLQLIDALN AASEALGLLI
ALSHAALQPT TQQLLAQLLK LAQHAWRTEL YASNEVKPSE IASGLQEAQI KHFLIDLGHW
YAIPGSIQRF IAG