Gene Haur_3274 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3274 
Symbol 
ID5735142 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4137775 
End bp4139538 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content48% 
IMG OID641280420 
Producthypothetical protein 
Protein accessionYP_001546039 
Protein GI159899792 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000551776 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGTA GGTGGCTTAA GCTTGTAGCA CTGTGTGCTA TCTTCTTGCT TGGCAGTATT 
TGGCTGCCCG CAGCTTCGGC AGAAACCTCG GTAGTAACCC TCAACGATCT TGCACTTGGA
ACAACCTCAT CAATTAGTAA CGACACCATG CGAATCGTCC ATAACGGATT AGCCTATTTT
ACCGCCAATG ATCAATCAAA TGGTTGGACA TTATGGCGTA GTGATGGCAC TGCCGCAGGG
ACATTTAGGC TGACCACTGC CCAGCAAACC AACATCATTC CCCAGTATCT GGTTGGTTAT
GGCGCATATG CCTATATTGC AACTTATACT CACACCGATG GTTACTCAAT CTGGAAAACC
GATGGCTCGC CCAATAGCAT TAGCAAAGTT GTCAGCTTCG ATTCAAGCAG CGTAACTACT
GGCTATGTCA AAATTAGCTT GATGCAAGTG GCCCAAAATA CACTCTATTT CTTCTTTGCT
TATGAATCGA TGGTTGAGGT TTGGAAACTT GATCAGGCCT TGCAACCAGT CAGGATTAAA
GCTTTTCGGG CGAGCGCAAT GCAATTCACC GTTTCAAGCA TTGATCGCTT AGTTGAATTC
AATGGAGCGG TTTATTTCTT GGTTTCAACA ACCGATATAC CATTTTGGGG ATATTTCTCA
GATTTATGGC GTACCGATGG CACTCCCGAA GGCACCTATA GCGTTCAGGT GATCGATGGG
CGCAAATATG CTAGAGTTCA AGGGCCAGTC GTTGCCGGCG GTAAGCTGAT TTTCAATTCG
TTGCTTGATG GGGTTGTGGC GAGCAATGGC TACCCCAATG GTAGTGAGCC ATTATTTGAT
ACCATCGAAG ATGGTGATGC CCCAATGCAG CTTGTTAGTG TGAATGGGAT TGGCCTGTTG
ACTCGGTATC ATGGTCATCG CTTGTGGCGA ACTGATGGAA CAGTTGAGGG AACCTATCCA
ATTGATGTTA ATCCACATGG CCCTGATAAT CTCATATTTG GCCCAGTGGT TGGTAACTAT
CTCTACTTTA CTGCCGAACA CCCGAGTTAT GGCCGCGAAT TGTGGCGCAC CAATGGCACG
CTTGCTGGCA CCAGCTTAGT CATCGATGGC ATTGCAGGCC CGACGAGTAG CAATCCCATC
AACTTCGCGA CAGTTGGCAA ACAACTTTAT TTCACTGCCA CGAATAGCCA AGGTGGTGTC
CAACCATGGA AGTTGGATTG TGTTGGAGCC AACCCGCAGA TGATTGGCCC AATTGATTCA
ATTAGTGCCA ATGCCAACCC TGGCATGTAC CTTGAAACTC CCCAAGGCAT CGTGTTTGCT
GCGAATACCC CTGCTCTTGG CCACGAACCA TGGCTGTATC GCGAAAGTGG CAATACGTGG
CTGAAAAGTA ATGCAGTAGT CGCAACTGCG AGCGATCAGG TTGCCGCAAT TCCGGTGACG
ATTGGCAATG ATGGCACGAT TATGCAGCAA ACTCTCGAAC TCACCCTTAT TCTCACTGAT
AGGGTCGAAT ATCTCAGTGA TACAAGTGGC ATTACCCCGA CAATCCAAGC CAATAGTTAT
GCTTGGAAAC TTGATCGGAT TGCAGCAAAT TGTAACGAAC ACAGTTTTGT GGTGTATGTT
AAGTTGCCTA GTTTCTCCTT AAATCAACAC CGTCCATTTA CGCTCCAACT AAGTGGCATG
GCCCCAGGCG ATAGTGCCAA CGACAATCAA GTTAATGGCC AGCTGGTGGT TGGTACTCCC
CTCTTCTTAC CAGCAGTTCA ATAA
 
Protein sequence
MGRRWLKLVA LCAIFLLGSI WLPAASAETS VVTLNDLALG TTSSISNDTM RIVHNGLAYF 
TANDQSNGWT LWRSDGTAAG TFRLTTAQQT NIIPQYLVGY GAYAYIATYT HTDGYSIWKT
DGSPNSISKV VSFDSSSVTT GYVKISLMQV AQNTLYFFFA YESMVEVWKL DQALQPVRIK
AFRASAMQFT VSSIDRLVEF NGAVYFLVST TDIPFWGYFS DLWRTDGTPE GTYSVQVIDG
RKYARVQGPV VAGGKLIFNS LLDGVVASNG YPNGSEPLFD TIEDGDAPMQ LVSVNGIGLL
TRYHGHRLWR TDGTVEGTYP IDVNPHGPDN LIFGPVVGNY LYFTAEHPSY GRELWRTNGT
LAGTSLVIDG IAGPTSSNPI NFATVGKQLY FTATNSQGGV QPWKLDCVGA NPQMIGPIDS
ISANANPGMY LETPQGIVFA ANTPALGHEP WLYRESGNTW LKSNAVVATA SDQVAAIPVT
IGNDGTIMQQ TLELTLILTD RVEYLSDTSG ITPTIQANSY AWKLDRIAAN CNEHSFVVYV
KLPSFSLNQH RPFTLQLSGM APGDSANDNQ VNGQLVVGTP LFLPAVQ