Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1724 |
Symbol | |
ID | 5733611 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2006911 |
End bp | 2008611 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 641278866 |
Product | hypothetical protein |
Protein accession | YP_001544495 |
Protein GI | 159898248 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3666] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00874348 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATGC GTGCGGCGTT GTTTGGTGAC TATCCCATTC CTGAGGACAC CGTTGAATTG GCACACGCCA TTGCTCCACA TGGCAACCGA CTCATGCACC TCCGTGATCA CTTTGGCATG CTGTTTGACA ATCAGCAATT CAGCACGCTC TTTTCCCATA CTGGTCAACC GGCCCTCGCG CCAGCACGAC TCGCCATGGT CACCATCCTC CAGTTCATGG AAGATCTCCC CGATCGCCAA GCCGCCGATG CCGTGCGGAT GCGCATTGAT TGGAAATATG TCTTGGGCCT TCCGCTTGCT GATCGTGGCT TTGATGCCTC CGTCCTCAGC GAGTTCCGCG CCCGGCTTGT GGCGGGAGAT GGTGCGTCTA TCCTCTTTGA AACCCTGCTG GAGCGCCTTC GCGATTACGG ATTACTGCGA ACACGCGGGC AACAACGCAC CGATTCGACC CATGTCCTCG CGGCAGTTCG TGGCCTGAGT CGCATTGAAT GTCTTGGCGA AACGATGCGT GCCACCCTCA ATGCGCTCGC AACCGTGGCT CCCGCATGGG TCCGCTGCCA GATTCCACCG CCGTGGTTTG ATCGCTATGG GCCACGTGCT GATGCATATC GCTTTCCGAA GGCAGCCGCC GACCGTCAAC GTCTTGCCGA GCAGATTGGA GCCGATGGGT TTGAACTCCT CACCCTCCTT GCGGCACCGA CTGCCCCGCG TGAACTGCAC GTTCATCCCG CCGTATGCAT CCTTCGTCGC GTTTGGTGGC AACAATACCA TGCCCCCAAT GGACCAGTTC GCTGGCGTGA AGTGGCTGAT ATGCCACCGA GCAGGATGCG CATTCATTCG CCCTATGATC GCGATGCGCA GTACAGCACG AAACGCAATA TGGAATGGAC GGGATACAAA GTGCATCTGA CTGAAACCTT CGATGCTGAT CTGCCCTGCC TCATCACGCA TGTGCTTACC ACGCCATCCA CGATTCGTGA TGGTGAAGTC TTAGACCAGA TTCATGAGGG CCTTGCTCGC CATGATCTCC TTCCCAGCAC CCATCTTGTT GATACGGGCT ATACCGATGC AGCAGCGATG CTCACCAGCC AGTCCACCTA TGGGATTACA CTGTGCGGGC CAATTGCTCG CGATAGTGCC TGGCAAGCGA AAGACCCGAC CGCCTTCGAT ATCACGCGAT TTCAGGTCGA TTGGGACGCG AAGGTGGTCA TTTGTCCCCA AGGACACGCA AGTACCAAAT GGATTTCGCA TCAGGATCGA CACGGGAATC CTGCCATTCG CGTGACGTTT CGACCGCGTG ACTGCCGAGC GTGTCCAGTC CGAACACAGT GTACGCATAC GGCAACCGCA GCACGAGGCC TTTCGCTCCG CCCCCGAGAA CAGCATGAAG TCCTTCAGCA GCGGCGGCAC GCCCAAACAA CCGATGCCTT CAAACGGCAG TATGCAAAAC GGGCGGGAGT CGAGGGACTA ATGTCGCAAG CAACCCGAGT CTGTGGGATG CGGCAGAGTC GCTATGGTGG GATGGCGAAA ACGCGACTCC AGCATGTGCT GACCGCGTGT GCGCTGAATC TGCTGAGGAG TGTGGCATGG GTGACCGGTG GGTCGCGTCA CCAAACCCAA ACGTCGCGCT TTGTGGCCCT CCGTCCACCG CCTGCTCTGT CTCAGATACG TGAACAGACA CGGCTCCATA GCCACCAATG A
|
Protein sequence | MTMRAALFGD YPIPEDTVEL AHAIAPHGNR LMHLRDHFGM LFDNQQFSTL FSHTGQPALA PARLAMVTIL QFMEDLPDRQ AADAVRMRID WKYVLGLPLA DRGFDASVLS EFRARLVAGD GASILFETLL ERLRDYGLLR TRGQQRTDST HVLAAVRGLS RIECLGETMR ATLNALATVA PAWVRCQIPP PWFDRYGPRA DAYRFPKAAA DRQRLAEQIG ADGFELLTLL AAPTAPRELH VHPAVCILRR VWWQQYHAPN GPVRWREVAD MPPSRMRIHS PYDRDAQYST KRNMEWTGYK VHLTETFDAD LPCLITHVLT TPSTIRDGEV LDQIHEGLAR HDLLPSTHLV DTGYTDAAAM LTSQSTYGIT LCGPIARDSA WQAKDPTAFD ITRFQVDWDA KVVICPQGHA STKWISHQDR HGNPAIRVTF RPRDCRACPV RTQCTHTATA ARGLSLRPRE QHEVLQQRRH AQTTDAFKRQ YAKRAGVEGL MSQATRVCGM RQSRYGGMAK TRLQHVLTAC ALNLLRSVAW VTGGSRHQTQ TSRFVALRPP PALSQIREQT RLHSHQ
|
| |