Gene Haur_3253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3253 
Symbol 
ID5735121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4115352 
End bp4116611 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content45% 
IMG OID641280399 
Producthypothetical protein 
Protein accessionYP_001546018 
Protein GI159899771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000295877 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCAG ACATTATTGC CAATCACTAC GAGGCTGTAG AACAAGCCGG AGTACGCTTC 
AAAAAATTGT ATGATTACCA AGTGCAGATG GAAACAATGA TTCGCCAGAT GTACGAGCAA
CTTCGCAACG ATGGATGGCA AGGTACGGCT GCCCAAGCAT TCTTTGTTGA AATGACCGAT
GTTGTGTTTC CTACCTTGCA ACGGTTGCAA AATACCTTGC TTGTGAGCAG TGAGTTGACC
AAACAAATCA ATCAGATCTT CCGTCAAGCT GAAGAAGCGG CTGCAAAATG TATTACGTTT
GATAGTGGTT CAATTGGTGG CATGCTGGCG GATGCTGGTC TTGCTGCTGG TCATGGGTTG
GCGGGTGTGA GCATTGGTGC TGCGCTATTC GATGATACGT CAAAGCCGGA TTTACGCAAA
ACAATCTTCA ACGACGATTA CATGAATCGT TTGGTTGGCT TAAAAATGCA AGGTATGGAT
GTACCTGAAC TGAATAAAGC TATGGAAACC ATTATTAATG ACAAAGCTTC AGAAGCCGAT
GTTCAAGCGG CTTTGGTCAA AATTGCTAAA TTGCGCGATG TGCCACTGGA GAAAATTCAG
GCCGACCACA AGAGGTTTGT CGAATTACGG AAGCGAGCCA CCGAAAAAGG TGACGTTGAT
CAACTTGATC AAGATCGACA TCCTGACTTT TTAGGCAGTA CCACAAGTTT ACGGTTTGGC
AAAGTTCTTG GCGATGTCTT TGGTATCGAT CCAGTTTTTG GTTCATTGCT CAGTCCTACA
GGCGGCTTAG TGGGTTACGG TAATATTGCG ATTGATGCTA AGGAACGTCC AGTTGGCTAT
CACGGCATCA TGCACGACGC AGGTGGGTAT CTGCTCAAAC ATCATGAGAT GGGACCAGGC
TATGATTACC TTGGCTTGGA AGGCCGTAAT CCCACTCACC CATTGACTGG TCAAGAGTCA
GGCATTCGTT ATTGGAATCA GAAATTAGCC TATGGATTTA TTGATGGCGA TGGTATTAAT
CTTAATAACC CCGTCCAAAA ATTGGCTGAA TACGCGGTTA TTGATGCTGT AGATCATAGA
TTGGGTCAAA TTGTGGATGT TCATGAAGGG CTTAAAACCG TTAAAAATGC GGCGAACGAT
GGCTGGAATG CCGCCAAAGA TGCGGCGAAC GATGGTTGGC ATGCAACCAA AGATGCGGCA
AGTGATCGAT GGAATGCTGC TTCAAAATCA GCGAAAGATA TATTTTCTTT TCTATTCTAG
 
Protein sequence
MSADIIANHY EAVEQAGVRF KKLYDYQVQM ETMIRQMYEQ LRNDGWQGTA AQAFFVEMTD 
VVFPTLQRLQ NTLLVSSELT KQINQIFRQA EEAAAKCITF DSGSIGGMLA DAGLAAGHGL
AGVSIGAALF DDTSKPDLRK TIFNDDYMNR LVGLKMQGMD VPELNKAMET IINDKASEAD
VQAALVKIAK LRDVPLEKIQ ADHKRFVELR KRATEKGDVD QLDQDRHPDF LGSTTSLRFG
KVLGDVFGID PVFGSLLSPT GGLVGYGNIA IDAKERPVGY HGIMHDAGGY LLKHHEMGPG
YDYLGLEGRN PTHPLTGQES GIRYWNQKLA YGFIDGDGIN LNNPVQKLAE YAVIDAVDHR
LGQIVDVHEG LKTVKNAAND GWNAAKDAAN DGWHATKDAA SDRWNAASKS AKDIFSFLF