Gene Haur_3255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3255 
Symbol 
ID5735123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4117233 
End bp4118333 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content50% 
IMG OID641280401 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001546020 
Protein GI159899773 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0515461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTACG TTAACCCCTC GCCAACGATT TCAAACGTTG TCTGCAATCA ATTATCAATT 
GTGCTCTTGG TGGGCGTAAT GTTGGGAACA TTAACTGCCT GTAGCACTTC AAGTTCTAAT
CCGTTGCCTA CAATTGGCAT TGATGTCGGG CCAGGGGTTA ACTTTCGCGG CGATATTGTG
GCGCTCCAGT TTCTGCCCAA CCAACAGCTT TTGGTGGGGG TTGGCTACGA TGGGGTTTAT
CGTTGGAAGC TTGCTACGAG CACAATTGAA CAAACCCTTG CCGCCAAACA AATTTACTTG
GCCTTTACAG CTTCAGGGCC ATTGGTTGTA AGCACTGATC GTAAAACGCT GACGACTTGG
AATAGCACTG ATGGCCAAAA AATTCTGGCG TGGAATGCCA AACCATTGCA ACTGCCTAGC
CAAACCACTG CCTTTGAAGT CAGTGCCCTC GCAATCACAC CTGATCAGCA GCAAATTATT
GCAGCCTACA ACAAAGGTAG CATGCTCCAA GCTTGGAACG TGGCGACTGG TGCGGCAACA
ACGACCTTTG GTGCTCCGGC CAAAACTGGC TCAATTGTTG AAATTGCGCT CAGCCCTGAT
GGTCAATTAC TGGCCAGCAA CGATTTTAGT GGCGTAGTGC AGATTTGGGA TGTAGTGAGT
GGTCAGCAAT TACATTCATT CAAAGAAGCC AGCCTCAACT ATCAACCAGG CAAATTGGCT
TGGAGCCACG ATGGCAAATG GCTGGCAGCC AGTAGCGGCG ATAAAAACGG CGGCGGAGTC
GCGATTTGGG ATACCAGTTC ATGGTCAATC TATGCCACGC ATCGCAGCAG TGAGCACCAA
TTTGCTGGTT TGGCCTTTCA TCCAACCGCT CCAACGTTGG CGATTGGCAA TAGTAGTGGC
TTGATCGAGT TGTACGATCT GACCAGCAAA CAAGTCAGCA ACAGCCTCAA AGGCCATGCC
GAGCGGGTTA CAACCTTGGC ATGGAATGCT GATGGCAGCC AATTGGCCTC AGGCGGCAAA
GAACCATTTG TCCTGATTTG GGATAGTACA AGCCTGACCG AGCAGCAACG TTTGGTTTTG
CCAGCTACAC CATTACAGTA A
 
Protein sequence
MRYVNPSPTI SNVVCNQLSI VLLVGVMLGT LTACSTSSSN PLPTIGIDVG PGVNFRGDIV 
ALQFLPNQQL LVGVGYDGVY RWKLATSTIE QTLAAKQIYL AFTASGPLVV STDRKTLTTW
NSTDGQKILA WNAKPLQLPS QTTAFEVSAL AITPDQQQII AAYNKGSMLQ AWNVATGAAT
TTFGAPAKTG SIVEIALSPD GQLLASNDFS GVVQIWDVVS GQQLHSFKEA SLNYQPGKLA
WSHDGKWLAA SSGDKNGGGV AIWDTSSWSI YATHRSSEHQ FAGLAFHPTA PTLAIGNSSG
LIELYDLTSK QVSNSLKGHA ERVTTLAWNA DGSQLASGGK EPFVLIWDST SLTEQQRLVL
PATPLQ