Gene Haur_3940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3940 
Symbol 
ID5735801 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4936064 
End bp4937608 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content55% 
IMG OID641281091 
Producthypothetical protein 
Protein accessionYP_001546702 
Protein GI159900455 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.405869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGATGC CGCCGTTGTG GCATCTCTCC CTGAATCGTA GCAGCGTTGC TACCACCAAG 
CTGCATTCCC AGATGCCCAG TTTTGCATCC ACCGCGTACC TCCGGAGGTT TCTGTTGCGA
CACATTGCCC GTTTCTTGGC GCTTGGTTGT CTGCTCGCCG TGAGCATGAC CCCCATCCGA
GCGCAGGCGA ACCCAACCTT GTGGGCTGCT GCTGCATCAA CCGACACCCT CGTTCTTGCT
TCTGGTCAAC TGACCCTCGC CCCGAATGCG GCCCAAGCGG CTGCGGTCTA TGGCTCATAC
GCTCGTTTTG GAATGTTCGA TAGTGCTCCG CAAAACATCG CTCCCGCCAA TCAAGTGCTG
GTTACATGGG GCGCAACTGT GCCCGCTGCC GCTAGCGTGC GCGTCGATGT GCGTGGCTTC
AATGGCCAAC GCTGGAGCGA TTGGACGCTT GATGTGCAAT CGGGCCAAAC GGTAGCTTTT
GCCACCATCG CCCGCCAAAT TCAATATCGT TTGGTGCTAT TGGCCAACGA GGCTGCGCCA
GTCGTTGATT TTGTGCAACT TGCGCCCAAC ACGCTTGCCG AAAGCGATGC CATCAGCATT
ATGGAAGATG AGCCGATTGC TCCAACCTAC CATATTCGGG CTACCCGGAT GGGCTTGGTT
GGCGATCGCA CGGCCAACGG CCATATCATT CAGCCAAACG ATTGGTTTGT TTCATTGCCA
TCGTTCCGCT CACTCTCATC GCGTGGCGGC GGCGAATACA TGGCGCGGCT TTCCTATCGT
GGCAAATCGA TTGTTGTGCC AGTTTGGGAA GTTGGGCCAT GGAACATTCA CGATGATTAT
TGGAATGTTG AGCGCGAGAA ATTTGGCGAT TTGCCTGTCG GCTGGCCCCA AGATCACGCT
GCCTATTTCG ATGGCTACAA TGGTGGCTGG GCTGAAAAAG GCCGCGTGCG ATTCCCCACC
GCTGCTGATG TCGGCGATGG CGCATGGGTC GCCTTGGGCA TTCCATTTAA CGATGAACAA
GAAGAACTTG ATATTACCTT CTTGTGGCTA GGCCGTGATC CTGGCGATAA CCCCGACCCA
ATGCCAGTTG GCAGTGTTAC GCCTGAGCCA GCCCCAATTG AAGAATTACC AGCGGGCACG
ATTCAGGTCG ATAATCAAGG CGAACAATTC AGCCGCTCCG ATGTGGCATG GTTTGAATTT
TCGTGTGGTA AAAATCGCCA TTCGTTCTGG ACCTTCTCAA CCAACAAGCC TGAAGAAGCA
GTCAATAATG CGCGTTGGAC AACTCCGCTT GAGGCTGGTG ACTATAGCGT GACCGTGTTT
GTGCCCTACT GCCCCAATGG CAAGAGCGAT ACAACTTCAG CACGTTATGT TGTGCAACAT
GCCGATGGCG AAACTCAAGT TGTCGTCAAT CAAGCGGAAC ATGCTGGCAA CTGGGTTGAG
CTAGGCCGCT ATCGCTTTGA TGGTACTGGC ACAGTTAGCC TCAGCGATTT GGCCGACGAC
CGCATGAAAG CCATTTGGTT TGATAGCGTG CGCTGGACAA AATAA
 
Protein sequence
MPMPPLWHLS LNRSSVATTK LHSQMPSFAS TAYLRRFLLR HIARFLALGC LLAVSMTPIR 
AQANPTLWAA AASTDTLVLA SGQLTLAPNA AQAAAVYGSY ARFGMFDSAP QNIAPANQVL
VTWGATVPAA ASVRVDVRGF NGQRWSDWTL DVQSGQTVAF ATIARQIQYR LVLLANEAAP
VVDFVQLAPN TLAESDAISI MEDEPIAPTY HIRATRMGLV GDRTANGHII QPNDWFVSLP
SFRSLSSRGG GEYMARLSYR GKSIVVPVWE VGPWNIHDDY WNVEREKFGD LPVGWPQDHA
AYFDGYNGGW AEKGRVRFPT AADVGDGAWV ALGIPFNDEQ EELDITFLWL GRDPGDNPDP
MPVGSVTPEP APIEELPAGT IQVDNQGEQF SRSDVAWFEF SCGKNRHSFW TFSTNKPEEA
VNNARWTTPL EAGDYSVTVF VPYCPNGKSD TTSARYVVQH ADGETQVVVN QAEHAGNWVE
LGRYRFDGTG TVSLSDLADD RMKAIWFDSV RWTK