Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0623 |
Symbol | |
ID | 5732521 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 718973 |
End bp | 720232 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277750 |
Product | hypothetical protein |
Protein accession | YP_001543399 |
Protein GI | 159897152 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGCAT TAGGCATTTT GGGCTTGGCC GTGGTGCTAT TTCTGACCGG CCTTTCGAGT AATATTTCTT TCTTTTATTA CTTAACCTAT ACCCTGCTGG GCTTATTGGT TGTGGCCTAT GCTTGGGCGT GGAGCAATCT CGAAGGGCTG GAGATCGAGC GCGAATCGAG CACTACTCGC GCCCAAGTTG GCGAAACTGT GCGCGAACGA ATTACGATCT ACAATACGTG GCCCGTGCCC AAACTCTGGG TCGAAGTGCG CGATCACTCC GATTTACCTT CGCATGGCTC GGGCTTTGTG AGCTATATTC CAGGCCGCGA AAAACGGCGC TGGATGGTGC GCACACCCTG TACCTTGCGC GGTAAATGGC GCTTGGGGCC AGTTTCGGTG CGCAGCGGCG ATCCCTTCGG TATCTTCCAA TTGCATAAAA TGGTCGATCT AACCCATGAT CTGTTGGTAT ATCCTGCCAC TGTCGATTTA CCAAAATTCG AGCTACCAAC CGCTGAGTTA ATCGGCGGGC AAGATGTGCG TTCACGCACC TTTCACGTTA CGCCGAATGT TTCGACTGTG CGCCAATATG TGCCAGGCGA TAGCTTCAAC CGCATTCACT GGCGTTCGAC AGCGCGAACT GGCAATCTCA TGGTCAAGGA ATTTGAGCTT GATCCATCAG CAGATGTTTG GATCATTTTG GATATGGAAG AACGTTTTCA CCATGGCCAG CCGGATGCAG CGATTCCAGC GATTATCAAA ACCTTGCAAG GCAAAATCGC AATTCCCAAC ACCACCGAGG AATATAGCAT TACCTTGGCA GCTTCGTTGG CACGGCATCT GTTGCGGATC AATCGCAGCG TGGGCTTATT GACCTACGGC GCACAACGCG AAATTATCCT GCCTGAACGC GAAGCTCGCC AGCTTTACAA AATTTTAGAG CCATTGGCGA TGCTACATGC CACCAGCAAT ACCTCATTAG CCGAATTATT GGCCGCTGAA AGCCAACGCT TTGGGCGTAA CTCCTCGCTG CTGATCATCA CCGCCGCGCT CGATGAACGT TGGGTGGCGG CAGTTCAGCG TTTGGTCTAT CGTGGCGCAC GCGCTTCAAT TATGTTTCTC GATGGTAAAT CGTTTGGCGG CTGGCGTGAC CCTGAGCCAA CCTTTGCCCG CTTGGCCGAG TTACGCGTGC CAGTCTATCG CATTCATGCT GGCGATGGTT TAGATCGCGC ACTGGCCGAG CCAGCTATTC GCCCAATGGG TCGGAGCTAA
|
Protein sequence | MRALGILGLA VVLFLTGLSS NISFFYYLTY TLLGLLVVAY AWAWSNLEGL EIERESSTTR AQVGETVRER ITIYNTWPVP KLWVEVRDHS DLPSHGSGFV SYIPGREKRR WMVRTPCTLR GKWRLGPVSV RSGDPFGIFQ LHKMVDLTHD LLVYPATVDL PKFELPTAEL IGGQDVRSRT FHVTPNVSTV RQYVPGDSFN RIHWRSTART GNLMVKEFEL DPSADVWIIL DMEERFHHGQ PDAAIPAIIK TLQGKIAIPN TTEEYSITLA ASLARHLLRI NRSVGLLTYG AQREIILPER EARQLYKILE PLAMLHATSN TSLAELLAAE SQRFGRNSSL LIITAALDER WVAAVQRLVY RGARASIMFL DGKSFGGWRD PEPTFARLAE LRVPVYRIHA GDGLDRALAE PAIRPMGRS
|
| |