Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3274 |
Symbol | |
ID | 5735142 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4137775 |
End bp | 4139538 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641280420 |
Product | hypothetical protein |
Protein accession | YP_001546039 |
Protein GI | 159899792 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000551776 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGTA GGTGGCTTAA GCTTGTAGCA CTGTGTGCTA TCTTCTTGCT TGGCAGTATT TGGCTGCCCG CAGCTTCGGC AGAAACCTCG GTAGTAACCC TCAACGATCT TGCACTTGGA ACAACCTCAT CAATTAGTAA CGACACCATG CGAATCGTCC ATAACGGATT AGCCTATTTT ACCGCCAATG ATCAATCAAA TGGTTGGACA TTATGGCGTA GTGATGGCAC TGCCGCAGGG ACATTTAGGC TGACCACTGC CCAGCAAACC AACATCATTC CCCAGTATCT GGTTGGTTAT GGCGCATATG CCTATATTGC AACTTATACT CACACCGATG GTTACTCAAT CTGGAAAACC GATGGCTCGC CCAATAGCAT TAGCAAAGTT GTCAGCTTCG ATTCAAGCAG CGTAACTACT GGCTATGTCA AAATTAGCTT GATGCAAGTG GCCCAAAATA CACTCTATTT CTTCTTTGCT TATGAATCGA TGGTTGAGGT TTGGAAACTT GATCAGGCCT TGCAACCAGT CAGGATTAAA GCTTTTCGGG CGAGCGCAAT GCAATTCACC GTTTCAAGCA TTGATCGCTT AGTTGAATTC AATGGAGCGG TTTATTTCTT GGTTTCAACA ACCGATATAC CATTTTGGGG ATATTTCTCA GATTTATGGC GTACCGATGG CACTCCCGAA GGCACCTATA GCGTTCAGGT GATCGATGGG CGCAAATATG CTAGAGTTCA AGGGCCAGTC GTTGCCGGCG GTAAGCTGAT TTTCAATTCG TTGCTTGATG GGGTTGTGGC GAGCAATGGC TACCCCAATG GTAGTGAGCC ATTATTTGAT ACCATCGAAG ATGGTGATGC CCCAATGCAG CTTGTTAGTG TGAATGGGAT TGGCCTGTTG ACTCGGTATC ATGGTCATCG CTTGTGGCGA ACTGATGGAA CAGTTGAGGG AACCTATCCA ATTGATGTTA ATCCACATGG CCCTGATAAT CTCATATTTG GCCCAGTGGT TGGTAACTAT CTCTACTTTA CTGCCGAACA CCCGAGTTAT GGCCGCGAAT TGTGGCGCAC CAATGGCACG CTTGCTGGCA CCAGCTTAGT CATCGATGGC ATTGCAGGCC CGACGAGTAG CAATCCCATC AACTTCGCGA CAGTTGGCAA ACAACTTTAT TTCACTGCCA CGAATAGCCA AGGTGGTGTC CAACCATGGA AGTTGGATTG TGTTGGAGCC AACCCGCAGA TGATTGGCCC AATTGATTCA ATTAGTGCCA ATGCCAACCC TGGCATGTAC CTTGAAACTC CCCAAGGCAT CGTGTTTGCT GCGAATACCC CTGCTCTTGG CCACGAACCA TGGCTGTATC GCGAAAGTGG CAATACGTGG CTGAAAAGTA ATGCAGTAGT CGCAACTGCG AGCGATCAGG TTGCCGCAAT TCCGGTGACG ATTGGCAATG ATGGCACGAT TATGCAGCAA ACTCTCGAAC TCACCCTTAT TCTCACTGAT AGGGTCGAAT ATCTCAGTGA TACAAGTGGC ATTACCCCGA CAATCCAAGC CAATAGTTAT GCTTGGAAAC TTGATCGGAT TGCAGCAAAT TGTAACGAAC ACAGTTTTGT GGTGTATGTT AAGTTGCCTA GTTTCTCCTT AAATCAACAC CGTCCATTTA CGCTCCAACT AAGTGGCATG GCCCCAGGCG ATAGTGCCAA CGACAATCAA GTTAATGGCC AGCTGGTGGT TGGTACTCCC CTCTTCTTAC CAGCAGTTCA ATAA
|
Protein sequence | MGRRWLKLVA LCAIFLLGSI WLPAASAETS VVTLNDLALG TTSSISNDTM RIVHNGLAYF TANDQSNGWT LWRSDGTAAG TFRLTTAQQT NIIPQYLVGY GAYAYIATYT HTDGYSIWKT DGSPNSISKV VSFDSSSVTT GYVKISLMQV AQNTLYFFFA YESMVEVWKL DQALQPVRIK AFRASAMQFT VSSIDRLVEF NGAVYFLVST TDIPFWGYFS DLWRTDGTPE GTYSVQVIDG RKYARVQGPV VAGGKLIFNS LLDGVVASNG YPNGSEPLFD TIEDGDAPMQ LVSVNGIGLL TRYHGHRLWR TDGTVEGTYP IDVNPHGPDN LIFGPVVGNY LYFTAEHPSY GRELWRTNGT LAGTSLVIDG IAGPTSSNPI NFATVGKQLY FTATNSQGGV QPWKLDCVGA NPQMIGPIDS ISANANPGMY LETPQGIVFA ANTPALGHEP WLYRESGNTW LKSNAVVATA SDQVAAIPVT IGNDGTIMQQ TLELTLILTD RVEYLSDTSG ITPTIQANSY AWKLDRIAAN CNEHSFVVYV KLPSFSLNQH RPFTLQLSGM APGDSANDNQ VNGQLVVGTP LFLPAVQ
|
| |