Gene Haur_5009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5009 
Symbol 
ID5736968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp15565 
End bp16749 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content57% 
IMG OID641282176 
Producthypothetical protein 
Protein accessionYP_001547767 
Protein GI159901521 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000976322 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCTCT TAGAACGCAA ACGTGCAACC GTCTATGACC ATGTGCGGGT GAACCAACCC 
CTGCGTGATC TCCACTACCA ACTGCCTGCG TGGATCACGA TTGCTGACCA ACCCTACCTT
GCAGTGGGCG ATGACAATGG TAATGGGGCC AAAAAGATTG CGGTGTTGGA TGCCAAATCG
CGGTTGATTA CGACCCGCAC GCCCACCGCC TATAAATTGG CCAAGGCCAT TCGGGCGGGT
CAAGGGGTGA TCACCTATCG GGTCAATGGC GGCGATAGTT TTTGGATTGG CGATGATGCC
TTACGCTTTG ATGGCGATGC GTTGCCCATT GGTGGCACCA GCCAACGCCT CTCCGATACC
CGCCAGCGCT CGTTTAATGC GGCCTGTATG GTCGAAACCC TGATCAAAGC GCGGTATAAG
CCGGGTGTCT ACCCCTTAGC CGTGGGCTTC GCCATCCCGA ATGAAGAAAT TGAGTCGCGG
GACAATGACA AAATGGGGGT GAATCCCGAG ACCCGCACGG CCCTCAAGAC CCATTTGAAT
GGGCAAACCT TTGTTGTGGA GCGCACCGAT GCCTTGGGCG TGGTAACCAA CTGGACGCTG
CGCTATGAAA AGATCATCCC GCAAGCCCAG TCGATCGGGA CGTTGTATGC GTGGTCACGC
ACCGTTGATG GCTCGTTAGA GGCCGACGGG ATTCGTCGCG TCTCGATTGT CGATATCGGT
GGAGGTGATA CCCAACTGAC CGAAGTGGAA CTGAATCCCT ACCGCATGAG TGCCGAACGC
TTGGGTGCGG GCACCATCAG TATTGCCCGG GAGTTGGCGG CGAAGTTTCA TCGGTTGCGG
TTGAGTGATG CCCAAGCCCA ATATGCGCTC GAAACGCAGT TGTTAGAGGA GTCGGGACGC
GAATTTCCGA TTGAAAGCGA AGTCAATGCG GCGATTCAAA GTGCCGGACA AGACTTAGTT
GGCCGGATGC TGAAGGTACT CCAGCAGCCG AGCGCCTACG TGATCATTAC CGGAGGTGGG
GTGAAATTGC AAGGGTTGCG GCGCTTGATT GAAGAACGGG CCGAGGCATC CGGCAAAACG
GCTCCCCGCA ACTACACGAT CATTGATCCG AGCGTCGCAG ATATCCTGAA TGCGACGGGG
GCACTGCTGG CGGTGGTCTA TGCGGCGGCA GGGAAAGGAG CCTAA
 
Protein sequence
MTLLERKRAT VYDHVRVNQP LRDLHYQLPA WITIADQPYL AVGDDNGNGA KKIAVLDAKS 
RLITTRTPTA YKLAKAIRAG QGVITYRVNG GDSFWIGDDA LRFDGDALPI GGTSQRLSDT
RQRSFNAACM VETLIKARYK PGVYPLAVGF AIPNEEIESR DNDKMGVNPE TRTALKTHLN
GQTFVVERTD ALGVVTNWTL RYEKIIPQAQ SIGTLYAWSR TVDGSLEADG IRRVSIVDIG
GGDTQLTEVE LNPYRMSAER LGAGTISIAR ELAAKFHRLR LSDAQAQYAL ETQLLEESGR
EFPIESEVNA AIQSAGQDLV GRMLKVLQQP SAYVIITGGG VKLQGLRRLI EERAEASGKT
APRNYTIIDP SVADILNATG ALLAVVYAAA GKGA