Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3221 |
Symbol | |
ID | 5735089 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4075136 |
End bp | 4076680 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280367 |
Product | hypothetical protein |
Protein accession | YP_001545986 |
Protein GI | 159899739 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.135564 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACAAC GAATTAATGT CGGTATGAAT CCAACCGTAG CGGTCCGCAT TTGTGAAGGT AGTTTGCGGG TAATTGCCCA CGATTTGCCC GAAGCTTGGT TTGATGTTGA TGAAGATGAA ATTAGTTTTC GCGCCCAAGA AGGCCATCTC TCAATCGAGC GTTGCTCCGA TGATTTGGAA TTACGCTTGC CCCATGGCGC AGTTTTAACC ATTGATGTGG TTCAAGGCGA TGTTGATCTG ACTGGTTTGA GCGCTGTGCA TACCCGCCAA ATTGAAGGCG ATATTAGCGC CCGTGATGTG CAAACCTTTG AAAGTGATTC GGTGCACGGC GATGCTAGCT TTACTAAAAG CGCCAAACTT AGCCTTGCCA ACATTGATGG TGATCTCAAA ATCTATGAGA ACACTGATAC TGTGGTGATT AAAAATGTCA ATGGCGATGC CAAAATTAGC CAAGCCCACA ATTTGACGAT TATCAATGTG AATGGCGATT TGAGCGCAAG CGATCTGTTT GGCGATGTAG CGATTACCCG CGTCAGTGGT GATGCTACGT TGCGTGGCGC GATCAAGAGC CTTGCGCCAA TCCATGTTGA TGGCGATTTG AAATTGGGCA TCAATTGGCT ACCTGAGCAA GTCTATCGTG CTAGTGCCAA TGGCGATGTT GTGTTAGAAG TTGCCGATGA TGCTAATTTG AGCGTCAATG GTTTTGTCCA AGGCGATGTC TCAGGCATGG GCGATCGTGA GCCTGGCTCG ATTAGCCTGA CTTGGGGTAC GGGTACAGCT CGCTTAGAAT TGAATGTGCA AGGCGATTTG AGCATTCGGG GCGGAGCTGC TAGCCATAGC CACTCCAAGA GTGTTGGTGG CACAAGCTGG AACACCAACT GGAATTGGAA TAACGATGAT TTTAACCGCG CTATTCGCGA TTTCACCGAT GATTTAGCCT CGATGGGCCG CGATATTGCC GCTCAATTCC GTGAAATGAG CCGCGATTGG CGTGATGGCA AGGGCGAACG TACCGCTGAA CGTGCTCGCC AAGCCACTGA ACGCGCCGCC GAACGTGCTG CCAAAGCCGC CGAACGCATG AGCGTGCGGA TCAACGAACG CGAATATCGC TTTGATCCTG AGCGAATCGA GCGCTTGAAA GAGCAAGCCA AGCGGGCTGC TGATGAAGGC ATTAGCCGTG CTTACGAAGC AATTGGCCAA GCCTTGGGCA ATATCGAAAA AAATATTGCT AACCCAAATG CGCCCCGCCC GCCAGCCCCA CCAGCGCCGC CAGCCGCACC GCATGCGCCT CAAGCACCAA ACGCCCCGCA TCGGGTGTCG ATTAGCGAAG ATGATGGCTC TTCATCAAGC CAACATGTGG CCTACACTGG CGATACGGTG CGGATTTCGC CAGAGCAAGC TGCCGCAGCC CAAGCTGCCA CCGCAGCCCC AGCCGAAACG CCAGCCGTCG ATAAAACCCA AGAACGCTTG GCAATTTTGA AGATGGTGCA AAGTGGCAAG ATTAGCGCTG ACGAAGCGGC ACTCTTGCTC GAAGCCTTAG GCTAA
|
Protein sequence | MQQRINVGMN PTVAVRICEG SLRVIAHDLP EAWFDVDEDE ISFRAQEGHL SIERCSDDLE LRLPHGAVLT IDVVQGDVDL TGLSAVHTRQ IEGDISARDV QTFESDSVHG DASFTKSAKL SLANIDGDLK IYENTDTVVI KNVNGDAKIS QAHNLTIINV NGDLSASDLF GDVAITRVSG DATLRGAIKS LAPIHVDGDL KLGINWLPEQ VYRASANGDV VLEVADDANL SVNGFVQGDV SGMGDREPGS ISLTWGTGTA RLELNVQGDL SIRGGAASHS HSKSVGGTSW NTNWNWNNDD FNRAIRDFTD DLASMGRDIA AQFREMSRDW RDGKGERTAE RARQATERAA ERAAKAAERM SVRINEREYR FDPERIERLK EQAKRAADEG ISRAYEAIGQ ALGNIEKNIA NPNAPRPPAP PAPPAAPHAP QAPNAPHRVS ISEDDGSSSS QHVAYTGDTV RISPEQAAAA QAATAAPAET PAVDKTQERL AILKMVQSGK ISADEAALLL EALG
|
| |