Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0418 |
Symbol | |
ID | 5732317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 486955 |
End bp | 488607 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277544 |
Product | hypothetical protein |
Protein accession | YP_001543197 |
Protein GI | 159896950 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000875198 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCAGG CTGTGCTACC ATGCCGCACT ATGGAGATGC AAAACTTGCC ACCAAGTGGC CCACCAACTC AACGAGCAAG GTTGCTCACC GCCCTACGTT CAAGTCGTGC GATGCAAAAA ATTCGTCAAT ACCAGACCGT CTTGTTGGCT GGTACGTTCT TTGTTTTAGC TGCAATTTTT GGTCTCAATT CGCAATCGAC GATCGAATCG CCCTCTGGTG CGGCTAGTCC GACAAGTTTG GCTCAGGCAA CTTTAGCTCC AGCTGATGGT TCAATTACGC CAACCGTGGC AACCAGCACC GAAGTTGCCA ATCTACGCCC AACACCCACC GATGTGGCGG GTGTGGTCGG GCCGCCAGCG CCTAATCGGG CAACGCCAAC CAATGAGGGT GCGTATCCTC CGCCAGTTGG TCAAGGGCCA ACTTCGACGG TTGATCCGTT TCCAACCTTG CCAGTTGGTG GCACGACGAT TCCACCATTC CCAACCAGTA GTACGGGCAA TCGTGGCACG GCTGCGCCAA CCCAAAGTTC GGGCGGTTAT CCTACGCCAA GTTCAAATAG TGCAACTCCA GCCCCAACCA ATATTCCGGT GGCAACCGAT GAATTACCCT TCCCAACCGA GGAGGGCGAA GATCCAAAGC CAACCGAAGA TCCATTTGAA GAAACTCCCA GTGCAACCGA GGAGCCGTTT GAAACCCCAA CCACCACGCC AACCGAAGGG CCAACCGCGA CCCCAACCAA TACGCCAACG CCGACACCAT TGCCTTACGA TCTGATTCGT GGCAATACGC GCTGGACATT GGCTCAAAGC CCAGTCAGAA TTCGCCGCGA TACGATCATC GCCAAAGGCG CAAGCTTGAC GATTGATGAT GGTGTGGAAG TATTGCTCGA TGCCAATACA TCGTTGGTGG TTGATGGAAC ATTGACTGCT AATGCAGCTC GTTTCCGCAA GAGCGGCAAC AGTTTCTGGA AATCGATCGT GGTTAATAAT GGTGGCCAAG CAAATCTGAA TTGGGTTGAT ATGCGCGGCG GTGGCTCCGA AGGCGTGTTG ATTTCAGCCC TTGGTGGCAA TACGGTCATC CAAGATAGCG TCTTTGAAGA GAACAAAGGC CGCATCTATA TTAGCGGTGG CAACTTTGAT ATGCAACGCA GCCGCGTAGT TGGCTTTGCT CCAATCAGCG CCGAAGTGCG TGGCGGCAAG AGCCTGCGCT TATTCAGTAA TGTGATTAAC AATACAGCCA CCGATGGCGC AACTGGCGTG AGTTTGAGCG CAACCGCCGA TGATGTGGAA ATTGTCTTGG AGAAGAACAG TTTCCGTGGT AGTAGCGGTA CGAACGTTCG CGCCCAATTC AATCAATCAT TGAATGCTGT GTTTCAGTGC AATAGCTTTA GCGGCGGAGC CTATGGCCTG CAAATCAAAT CAACTGACCC AACCTTAGAT GGTTCACGGA TTTTGATTAG CGGCAATAGC TTCCAAAGCC ACAAAAACTA TGGCCTAACT GGCGATGTTG GCTTTGATGC CCGCAACAAC TGGTGGGGCG ATGCCTCAGG CCCATATCAT CCTGAGCAGA ATGGCGCTGG CACCGGCGAT GCGGTTGGGG TCAATTTGAC CTTCAGTCCA TGGCTGAACG CCAAACCAAG TTGCGCTCCT TAA
|
Protein sequence | MAQAVLPCRT MEMQNLPPSG PPTQRARLLT ALRSSRAMQK IRQYQTVLLA GTFFVLAAIF GLNSQSTIES PSGAASPTSL AQATLAPADG SITPTVATST EVANLRPTPT DVAGVVGPPA PNRATPTNEG AYPPPVGQGP TSTVDPFPTL PVGGTTIPPF PTSSTGNRGT AAPTQSSGGY PTPSSNSATP APTNIPVATD ELPFPTEEGE DPKPTEDPFE ETPSATEEPF ETPTTTPTEG PTATPTNTPT PTPLPYDLIR GNTRWTLAQS PVRIRRDTII AKGASLTIDD GVEVLLDANT SLVVDGTLTA NAARFRKSGN SFWKSIVVNN GGQANLNWVD MRGGGSEGVL ISALGGNTVI QDSVFEENKG RIYISGGNFD MQRSRVVGFA PISAEVRGGK SLRLFSNVIN NTATDGATGV SLSATADDVE IVLEKNSFRG SSGTNVRAQF NQSLNAVFQC NSFSGGAYGL QIKSTDPTLD GSRILISGNS FQSHKNYGLT GDVGFDARNN WWGDASGPYH PEQNGAGTGD AVGVNLTFSP WLNAKPSCAP
|
| |