Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0073 |
Symbol | |
ID | 5731946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 95172 |
End bp | 96338 |
Gene Length | 1167 bp |
Protein Length | 388 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641277195 |
Product | hypothetical protein |
Protein accession | YP_001542853 |
Protein GI | 159896606 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAGTA AAGTGAAATT TTTCGCGATT TTCATTGGCA TCGTCGCGTT GTCATTTTTC TTATTTCGCT CCTGCGATCC TCAAGCGCGT AACCTGCGGC CTGGCTCGAT GTACAACCAA GGTGGCGACG GAACTGCCGC TTTGCAGCTT TGGCTGAGCA AAATTGGCTA TCAAACTGAG TCATTTGAAT ATCGCGATTT CGATGAGCTT GACCAAACAA TCGATACTTT GCTTGTAATC AAGCCAAGCG ATACTAATAA TTGGCGCAAG GAAGAAATTG ATGCGGTTTT AGCGTGGGTT GAAGATACCG GCGGCACGCT GATTGTCGCT GATGATCAAC AAAATGGCCT TTTGACCCGC TTGGATCTGA CGGTCACCCG CATCGAAGCC TTGGAAATGG TCAGCACCAG TGATACCAGT CATGCTTTGG TGAACCCGAT AGTAAACGGT TTGCAAGGCT ATCAAACAAT TAGCTATTTT GAGCGAGTGA GACCCAATAG CCAAGTGATT GTTGGTAGCG AAGCACAGCC GACGACGCTT GGCATTAGTC GTGGCCGTGG GATGATCTAT GCTTCAACCA ATATTTGGCT CTTTACCAAT GCTGGTTTGT TTTATGAAAG CAATGCCAAA ATTATCCTTA ATATGGTTAA TCGGATGCCC GCAGGCAGTG TAATTGCCTT TGATGAGGTA CATCATGGCC GTGCCTTACC ACCCAAAGCC GCTCCTGTGC CAGCCCAACC CTATTCGCCT TTGGTTGCGG CGATGGTCTA TAGCGCAATG GTTGTGGGCT TATGGGCCTT GCTTTCTGGT CGCCGTTTTG GTCAGATTGT GCCCAGCAGA ATCGATTTGA TGCGACGGAA TAGCAGCGAA TATGTCCAAT CGATGGCCAA TTTGTTTCAG CGCGGTCGTC AAGCCGAACA TATGCAAGCC CACTATAAAA CCTATCTCAA ACGCCGAGTT GCTAAGCCCT ATGGGATTAA CCCCAAGCTT GATGACCAGA GCTTTTTAAG TGAAGTTCAG CGGTATTCCG ATACAATCGA TCGTAATCAC TTAGCTCATT TGCTCAACCA TTTAAGCCAG CCCAACCCCA GCGAGGCGAC GATCTTGGCC CTGGTCAACG ATATCGATCG CTTTATCAAC CTATGGGAAC AACAGGGTCG GGCCTAA
|
Protein sequence | MNSKVKFFAI FIGIVALSFF LFRSCDPQAR NLRPGSMYNQ GGDGTAALQL WLSKIGYQTE SFEYRDFDEL DQTIDTLLVI KPSDTNNWRK EEIDAVLAWV EDTGGTLIVA DDQQNGLLTR LDLTVTRIEA LEMVSTSDTS HALVNPIVNG LQGYQTISYF ERVRPNSQVI VGSEAQPTTL GISRGRGMIY ASTNIWLFTN AGLFYESNAK IILNMVNRMP AGSVIAFDEV HHGRALPPKA APVPAQPYSP LVAAMVYSAM VVGLWALLSG RRFGQIVPSR IDLMRRNSSE YVQSMANLFQ RGRQAEHMQA HYKTYLKRRV AKPYGINPKL DDQSFLSEVQ RYSDTIDRNH LAHLLNHLSQ PNPSEATILA LVNDIDRFIN LWEQQGRA
|
| |