Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0921 |
Symbol | |
ID | 5732690 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1053248 |
End bp | 1054444 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278053 |
Product | hypothetical protein |
Protein accession | YP_001543697 |
Protein GI | 159897450 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTAACC CAGCTTTAAC TCTGAATGAT TTTGCTGCGC TTGAGGCCAG CACCGACTAT CGCCCGCACG TTGTTGATGA TGTGCAAGTT TCGCCATTTA GCCAACAGCA AACCAACCAA TATGTGGTTG GCCGCCCAGC TTCGCCATCG TTTATGCGCA CCAATGTGGT TGGCGTGCGG ATTTTGGAAT TGCTGAATGG TCAGCGTTCA ATCGAGCAGC TACAAACCGA AATGCAACAA CGCTATGGCA TTGAAGTCAG CATTGAAAAG ATTCGCCATT TTTGCGATTT ATGCGCTCGT CACCGTTTAT TAGTCCATGA AACCTGGGCC AACACAACTC CTGATTCAGA AGATGCCCCA CGTTTGGGCC GTCGCAGTGG TTTCTATTGG ACGCTGATTC GCGGCGATAG TTTTATTGAA CGGATTGTAG CTTGGCAGCG CTGGTGGTGG AATCCAGTCA CTTGGTTGTT GATGCTTGGT TTGTTGGGTG TGGGCTTGAC CTACATCGTA ACCAAAGGCG TGATGTTGGT TTCGCCCTTG ATGCTAGCCC ACCCCGATGC CAGCAAGCAC CTGATTTTCT GGCTGTTTGG GCTATTTCTG GTCGAGATTG CGACTCACGA ATTGGCCCAC GCTGTGGCTT GTCGCATGGC TGGAGCCAAG CCAGCTGGCT TTGGGATGGG CTTGCTCTGG TGGTTTATCC CGATTTTCTT TACCGATACC AGCGATATTT ACCGCGTGCC GAGCAAATAT CGGCGAGCTT CGGTGGCAGC GGCTGGCCCC TTAGTCGATG CGCTGTGGTT TGGGGTCGTT GCTAGTTTGC TTTGGCTTTT GCCGCGCGAT AGTTTGGCCT ATGAAATTGC TTTTGCCTAT AGCGGCATCC CAGCCTTGCT CTGTTTGATC AACCTCAATC CATTTGTTTC GCGCATGGAT GGCTATTGGA TTATTACCAA TTTACTGGAG CAGCCCAATT TGCGCCGCCG TACTGTGCGC AACATCGTCA ACAACATACG CGGTTGGTTT GGCGCTGCGC CGATTATCGA CCCGCTGAGT GAGCAACAAT CAGGGCGATG GCAATGGGCC TATAACATGT ATGCCTTAGT TTCATTGCTC TGGACGGTTT TCTTTATGTT CAACATCGGC TTTCATTTTG CCAGCTATGG CTTCACGATT CTGCGAATGT ACGCCGCAGC ACTATAA
|
Protein sequence | MTNPALTLND FAALEASTDY RPHVVDDVQV SPFSQQQTNQ YVVGRPASPS FMRTNVVGVR ILELLNGQRS IEQLQTEMQQ RYGIEVSIEK IRHFCDLCAR HRLLVHETWA NTTPDSEDAP RLGRRSGFYW TLIRGDSFIE RIVAWQRWWW NPVTWLLMLG LLGVGLTYIV TKGVMLVSPL MLAHPDASKH LIFWLFGLFL VEIATHELAH AVACRMAGAK PAGFGMGLLW WFIPIFFTDT SDIYRVPSKY RRASVAAAGP LVDALWFGVV ASLLWLLPRD SLAYEIAFAY SGIPALLCLI NLNPFVSRMD GYWIITNLLE QPNLRRRTVR NIVNNIRGWF GAAPIIDPLS EQQSGRWQWA YNMYALVSLL WTVFFMFNIG FHFASYGFTI LRMYAAAL
|
| |