Gene Haur_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3716 
Symbol 
ID5735580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4674206 
End bp4675333 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content53% 
IMG OID641280868 
Producthypothetical protein 
Protein accessionYP_001546480 
Protein GI159900233 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000943682 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAATTG TTGAGGGATT AATCAATCTT GGGGTGTTTT TGATCGGGCT GGTGATTGTG 
GCATTCACGC TATTTAGTGC GATTCGCATG GTCGTGTTGC CACGCGGCGA TAATGTTTGG
CTGACGCGCA CCACCTTTCG GCTGGTCTAC GGCTGCTTTT TGGTGCGTTT GCGCCGAATC
AACAACTACG AACAAGCCGA TCGGGTGCTG GCATTTTATG CGCCAGTGGC CTTGCTGATT
ATTCCAGTGG TTTGGATGGT GTTGGCAATT GCCGGCTTTA CCTGTATGTT TTGGGCGGTT
GGCGAGCATC CTTGGAGTAA AGCATTTTTG CTGAGTGGCT CGTCGATGTT GACGCTGGGG
TTTGCCGCAG TTGATTCAAC GGTTGAAACA ATTTTGGCCT TTTTGGCGGC AACGGTTGGC
TTAGGCATTG TGGCTTTGTT AATTGCCTAT CTGCCAACGA TGTACGGGGC TTTTTCCAAG
CGCGAAGAAG CCGTGACCTT GCTGGAAGTG CGGGCAGGCA CGCCGCCATC GGCAATTACC
ATGATTCAGC GCTATCAGCG GATTCATAAT TTGGAGCGTA TGCCTGAGCA ATGGGCAATT
TGGGAGCAAT GGTTTGCCGA ATTAGAGGAG AGCCATACCA CCTTTGCGGC CTTGGCGTTT
TTTCGTTCGC CCCAGCCGTA TCGCTCGTGG ATCACCGCTT CAGCGGCGGT GTTGGATTCC
GCAGCGTTGG CAGTTTCGAC CCTTGATATC CCGCGCCAGC CAGATGCCGA TTTGTGCATT
CGGGCGGGCT ATTTGGCGCT GCGCCGCATC GCCGATTTGT ATATGATTCC CTACAATGCC
ACGCCGCATG CCGATGATCC AATTAGCATC ACGCGGGCTG ATTTTGATCA GGCTTGTGCT
GAATTGATCG CTAGTGGCGT GCCACTCAAA GCTGATCGCG ACCAAGCATG GCGTGATTTT
GCTGGTTGGC GGGTCAATTA CGATCAAGTG TTGATCGAAT TAGCCAAATT GGTAACCGCC
CCGCCAGCCC CTTGGACAGG CGAACGTCCG CTGCCATTAT GTTTCAACAT GCCACGTTTT
GGCCATAAAC GCCGCCAACT AGCCAAAGAG CCAACTGGCT CAATGTAA
 
Protein sequence
MTIVEGLINL GVFLIGLVIV AFTLFSAIRM VVLPRGDNVW LTRTTFRLVY GCFLVRLRRI 
NNYEQADRVL AFYAPVALLI IPVVWMVLAI AGFTCMFWAV GEHPWSKAFL LSGSSMLTLG
FAAVDSTVET ILAFLAATVG LGIVALLIAY LPTMYGAFSK REEAVTLLEV RAGTPPSAIT
MIQRYQRIHN LERMPEQWAI WEQWFAELEE SHTTFAALAF FRSPQPYRSW ITASAAVLDS
AALAVSTLDI PRQPDADLCI RAGYLALRRI ADLYMIPYNA TPHADDPISI TRADFDQACA
ELIASGVPLK ADRDQAWRDF AGWRVNYDQV LIELAKLVTA PPAPWTGERP LPLCFNMPRF
GHKRRQLAKE PTGSM