Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0545 |
Symbol | |
ID | 5732403 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 633661 |
End bp | 635367 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641277672 |
Product | hypothetical protein |
Protein accession | YP_001543321 |
Protein GI | 159897074 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAAATA CCTATATCGA AAGCGAGACA GCGCCTATTG GGGATTTTAT CAACAAGGTT GAAGCTTTAA AAGTACCAGT CTTTCAACGA AGTTTTGCTT GGACGGAAGA TGAGATTAAG CTGTTATGGG ATGATATTAC AGCAATTATT GAATTACCTC AGCCAGAGTA CTTTCTAGGA TCAATGGTTT TCAAGAAGAA GGATGGTTAT TATGAGATAA TTGATGGTCA GCAAAGAATT ACTGCTATAT ATATTATATT AAGCTGTATA AGAAATATTT ATCGAATAAA TAGTGATACC CAACGAAACG AATGGTTTAA TAGGGAATAT TTTGGAAAAA TAGATATAAA TACATTAACT ATAAACTCTA AATTTCAAAT GAATGAAATT AATAATCCTT TTTTTCAAAA ATACATTGTT TCTGAATCAA ATATTGAAGA AATAGAAGAA AACATCAAAA ATCTCCCTAA GAAAGATACG AATCATCTTC TTGGAAAAGC AATCAAACAG ATTTATAGTT TGCTGATCGA GAGGCAAAAG GAATTTTCTG GTACAGAATT TAGTTCAGAT GCGCTCCTAT CTATACATAA TATTATACAA AAACATATAT ATGTATTAAT ATTAAAGGTA TCAGATGAAG CGGATGCCTA TACTATTTTC GAAACACTCA ATGATCGAGG GCTGGGATTA ACAACTATGG AACTGTTAAA AAATCATATA TTTGAAAAAT CAGGTATGTA TCTTCCTTCA GTACAAAATG AATGGTCTAT TTTACGCGAT AATCTTCTTC ATAGCGATCC GAGTGATCGA TTTATCCACC ATTATTGGAC ATCGATCCAT GGACGAACAT CAAAACCACA ATTATTTAGA TTGATGAGAA AACAAGTAAA TACTCCCATC CAAGCGGTAG AATTCGCACG AAGTTTGAGG ACATCGGCCA GATTATATGC GGCGTTTGGA AATCCAAATG ATGATTATTG GAATGACCAT GAAAAACAAA CTAGAGATAA CTTAGAAGTA CTTGATATTC TTGATGCACA GCAGGTTATG CCAGTTTTAT TAGCAGCAGT ACAAAAATTT AATCCAACTG AGTTTAGGAA ATTAACCTAT GCTTTGGTCG TTATGGCAGT ACGATATAAT TTAATTGGAG AATTACGGAC AGGTGTTATT TCAAACTACT ATTCTGATAT TCCGAAAAAG ATCCATTCTG GTGAGCTAAA TAAAAGTATA AAAGTATGCA GAGAATTCAG ATCGATATAT CCTACAGACG AAGAGTTTGA GGAGGCGTTC AGTACTAAGG TTCTAAGTGA TGCAAAGAAA TCACGTTATT TACTTATAGA GTTAGAAAAA AATTTTAACG GAGGAAGCTC ACAGGTTTCT AATGATCCGA GAAAGGTTAA TTTGGAGCAT ATTTTACCAC AGAATCCATC AAATGAATGG ATTGATACAA TAACACAGGC AGGTGACGAT TTACGGAAAT ATATCTATAA ACTTGGTAAC CAAGCGCTCG TTTCAACAAC GCCAAATAAG AAATCAGGAG CAAAAGGATT TATTTACAAA AATACTAATC TCTATAGCAA GGAAACACAA ATTTATTACA CTCATATGCT TACACAATAT AGCACATGGC TTCCTGCAGA TATTATTAGA CGCCAAAAAG AACTGGCAAA GCAAGCCGTA AAAACATGGA GGATAGAATT CTCCTAA
|
Protein sequence | MGNTYIESET APIGDFINKV EALKVPVFQR SFAWTEDEIK LLWDDITAII ELPQPEYFLG SMVFKKKDGY YEIIDGQQRI TAIYIILSCI RNIYRINSDT QRNEWFNREY FGKIDINTLT INSKFQMNEI NNPFFQKYIV SESNIEEIEE NIKNLPKKDT NHLLGKAIKQ IYSLLIERQK EFSGTEFSSD ALLSIHNIIQ KHIYVLILKV SDEADAYTIF ETLNDRGLGL TTMELLKNHI FEKSGMYLPS VQNEWSILRD NLLHSDPSDR FIHHYWTSIH GRTSKPQLFR LMRKQVNTPI QAVEFARSLR TSARLYAAFG NPNDDYWNDH EKQTRDNLEV LDILDAQQVM PVLLAAVQKF NPTEFRKLTY ALVVMAVRYN LIGELRTGVI SNYYSDIPKK IHSGELNKSI KVCREFRSIY PTDEEFEEAF STKVLSDAKK SRYLLIELEK NFNGGSSQVS NDPRKVNLEH ILPQNPSNEW IDTITQAGDD LRKYIYKLGN QALVSTTPNK KSGAKGFIYK NTNLYSKETQ IYYTHMLTQY STWLPADIIR RQKELAKQAV KTWRIEFS
|
| |