Gene Haur_4160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4160 
Symbol 
ID5736021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5306860 
End bp5308410 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content53% 
IMG OID641281314 
Producthypothetical protein 
Protein accessionYP_001546920 
Protein GI159900673 
COG category[S] Function unknown 
COG ID[COG1700] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00251494 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCAGAGT TAATTCCACT GACAATTAAC GGTATCGATA GCCAAGCAAC TGTGCATGTC 
ACTGCCAGCG CTTGGAATGA GTGGGCCGTG GTCAACATGG CTTGCCCAGC CCAAACCAAT
GTGCGCAATG CGCAGCTTTC GATTGGTGCT GATAATTTGG GTGTGCCCCA AGTCAGCCCA
TTTGATCCAA CTTGGCGTTG GTCGTGGTTG CCACGTGGCC AGGCGGGCAC AGTTTATGGC
CGGTTGCAGA TTGAATGGGC TGATGGCGAA ATTCAGCAGC AGCAGTTTCA ATTTGAGCTT
CAGCCACATT TGCTTGATCG TGAATTGTGG CGAGCATTGC TGAACGATCT CAGCTCGTTG
GCTCGTTCGC TCGCCTTGCG AATTGCCAGC CCCAGCTTTG CCCAAGCGGT GTTAGTGCCG
TTGCTCCCCG ATGATCCTAG CCCATTTTTA GAAGCGCTGA GCTTGATTAA CCAAAGTAGC
CAGCAAGCGA GCCAGATTGT ACGCAGCTTG CAGCGTCAGG CCAAATCGAC ACTTGAACGC
CAACCGCGAA CAACTGACTT GGGCACGGCT CAGCAATTTA AGCTCGATCA ATTGGCTCAG
CCCAGCGAGC GTTATCACGT GCTCGAACCG TATGGCCTCG TGCCAGAGCA GGTGCAGGCT
GAGCATGCCC AAGCTTCATT CGATCTGCCA GAGCATCGCT GGTTGATTGG CTTGATTCAG
CAAATTGAGC GGCGTTTACG CAATTTACGT CGCCTTGCTC GTGAACAACG CCAGCTTGAT
TTAACCAGTA TTCAAGCAAC AATTGAGCAA CGACTGACGG CTTTGCGCCA ATTACGCCAA
GCAGCGCCCT TGGCGGGTTT AAAAGCTCGC CAGCAGCCAG TCCAAAGCCA ACTGATCAAC
CGTGATGCGC GTTACCGACC GATTCGCCAG TTGGCGCGAA GTTTGCATGA ACAGCCGTTG
CTCACCTTGG AAGTTGGCAG TTTGGCCTTG CCCTTGGCCG ATGTGCCAAC CCTCTATGAG
CAATGGTGTG CGCTAGCTGT CGCCCAGGTT TTAGCCGAAT TAGGCCAGGT TGAAGCCCAA
CATCTGCTGA TCGATAACCC TCAGCGTGAA CGTTGGGTGC TTGAATTAAA TTCCGCCACA
CCGTTGTTGA GCGTGCGGAT TGGCTCACAG CTTTGGCATT TGCGCTACCA AGCGCGATTT
AGCGCCCAGC CCGATTCTGA TGGTTTTTAT AGCCTTGATC GATATTTGCG CATCCCCGAT
TTGGTGTTAC AAACTGCCAC GGCCAATGCC AAGCAGGTGT TGGTGCTTGA TGCCAAATAT
CGCCGAGCGC CTGACCAGCG GGTTCCGCAA AGTGCGCTTG ATGATGTCTA TGCCTATCGT
GGCAGCTTGG GCTACAATGG TCAGCCATGT GTGCTAGCCG CCGCAATTCT GTACCCACAA
GCAAACACAC TTGAGGAATT TGGCTCGATT GCGGCAATTG GCCTCATTCC CAATCAGCTT
AATCAGCTAA AAACCTGGTT GGAGCGTTGG CTCAACCAGC TTGATCAATA A
 
Protein sequence
MAELIPLTIN GIDSQATVHV TASAWNEWAV VNMACPAQTN VRNAQLSIGA DNLGVPQVSP 
FDPTWRWSWL PRGQAGTVYG RLQIEWADGE IQQQQFQFEL QPHLLDRELW RALLNDLSSL
ARSLALRIAS PSFAQAVLVP LLPDDPSPFL EALSLINQSS QQASQIVRSL QRQAKSTLER
QPRTTDLGTA QQFKLDQLAQ PSERYHVLEP YGLVPEQVQA EHAQASFDLP EHRWLIGLIQ
QIERRLRNLR RLAREQRQLD LTSIQATIEQ RLTALRQLRQ AAPLAGLKAR QQPVQSQLIN
RDARYRPIRQ LARSLHEQPL LTLEVGSLAL PLADVPTLYE QWCALAVAQV LAELGQVEAQ
HLLIDNPQRE RWVLELNSAT PLLSVRIGSQ LWHLRYQARF SAQPDSDGFY SLDRYLRIPD
LVLQTATANA KQVLVLDAKY RRAPDQRVPQ SALDDVYAYR GSLGYNGQPC VLAAAILYPQ
ANTLEEFGSI AAIGLIPNQL NQLKTWLERW LNQLDQ