Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0790 |
Symbol | |
ID | 5732674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 892730 |
End bp | 893761 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277920 |
Product | hypothetical protein |
Protein accession | YP_001543566 |
Protein GI | 159897319 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0823] Periplasmic component of the Tol biopolymer transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATTCG TGATTTGTGT TCTTGTTTGT ATTCTCTTGT CAAGTTGTGG GGCGGCTAAT TCGGCGACGG TAACCCCGCC AGCTGCGGTT GCCCCCAAGC TTGGCGATAT TTCGCTGATT ATCTCGCGGG GTGGGGATTT GTGGCGTTTC GATTTGCCCG CTAAACAGTG GACGCAATTA ACCAACGAAG CGCCCAATGC CTACTCAACC TTCCCTAGTA TTGCTCCCGA TGGCAAAAGC GTGGTCTATT CGTATCGGCC ACCTGTGCCA ACTCCTAGTG CCGAGCAACC CTTTGTTGTG CCCGTTAATC ATGCCAACCA AATTGCGGTT GATGGCGGAG CCAGTAAATC GTTGTTTGTG CCCGATGGCG GCAAGCGCGA TGGTCTGCAA ATTTACGATT CACTCGATAC GCCAGTTTGG TCGCCTGATG GCAAAACCTT GTATGGGGTG TATCAAACCT TGCGTTTTGA TAACGATGGG GTCTTTTTGC AATCAGGCAG CTCGATTGTA GCGATTGATC TTGCTAGTGG TGTGCAACAA ACTCTCGTCA TGACCGGAAC ATTTCCATCG GTGTCGCCTG ATGGCCGCCA ATTGGCCTTC GTGCGTACCC GCGACGGCAT TTACCCAACC TTATACCTCT ACGATTTACA AACTAAGCAA GAGCGGGTGC TGTTTGATCA GCCCAACTTG GTGAGTGCCT TAGAAGCACC AATCTGGTCG GCGGATGGCA AAACAATTTA TATCGCGGCC AGCCCTTTGA CAATTGGCCA ACATCAACCA AACTGGGTCG ATTGGTTTGT CACGCCAGCA TCTGCCCATG GTTTGGGCTG GCAAGTGTGG TCGGTTGATC TGGCGACAGG TCAAGGCAAG CCGGTCAATA GCGAAATCTT CGAAGATCCT CGGATCGTCG TTGATGGTTC GCTGTTGTAT GTTTGGACGT TCTCAGGCCT TTGGCAGATG GATTTGGTCG GTAATCCTCC AGTTTTAATC GAAGAGCCAG GCGATATCGG CGGTATGACG CGGGTTCCCT AG
|
Protein sequence | MRFVICVLVC ILLSSCGAAN SATVTPPAAV APKLGDISLI ISRGGDLWRF DLPAKQWTQL TNEAPNAYST FPSIAPDGKS VVYSYRPPVP TPSAEQPFVV PVNHANQIAV DGGASKSLFV PDGGKRDGLQ IYDSLDTPVW SPDGKTLYGV YQTLRFDNDG VFLQSGSSIV AIDLASGVQQ TLVMTGTFPS VSPDGRQLAF VRTRDGIYPT LYLYDLQTKQ ERVLFDQPNL VSALEAPIWS ADGKTIYIAA SPLTIGQHQP NWVDWFVTPA SAHGLGWQVW SVDLATGQGK PVNSEIFEDP RIVVDGSLLY VWTFSGLWQM DLVGNPPVLI EEPGDIGGMT RVP
|
| |