Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1814 |
Symbol | |
ID | 5733672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2109001 |
End bp | 2110665 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641278957 |
Product | FG-GAP repeat-containing protein |
Protein accession | YP_001544585 |
Protein GI | 159898338 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000122942 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGACCA CAATGGTTTT TCGGCGTGCC GTGCTGGCGA GCATCGGTAC AATTGTCGTG AGCAGTTTAT TGGTAGTTAA TGCCAATAGC CGAGGCTTGC TGGCCCAAAC GCTGCCTAAT CCAATTGATT TTCATCCAGT GGTGACCTAT AGTTCACCAA GCCGCCCATG CGAATTGGGT CGTGGTGATT TCAATGGCGA TGGTTTTGTC GATCTAGCGA CGGCAAATCA GGCAAGTAGC GAAGTTCAAA TATTTTTAAA TAATGGAGCC GGCGCTTTCC CAACGCACAC CACGTATAGT GTTGCTACTC CTTGTGGCAT CGATGTCGGT AATGTTGACG GTGATAACGA TTTAGATATT GTCGTGACCA AGCAAACCTC AAATCAACTA GGTGTATTGC TCGGTAATGG CGATGGCACG TTCCAAATTG CCCAATCGTT TAGCACTGGT GCACGCCCAA CCGACGTAAT CTTGCGTGAT CTAAATCAAG ATACTGAACT TGATGCGGTT ATAACCAACC AAGATAGTCA AGGGGTTAGT ATTTTGTTGG GCAATGGCAA TGGAACCTTT GCTAATCAAA CGATCTACAC GGTTCAGGCT TCGCCAACGC TTGAGGCAGT TGGTGATTTA ACTGGCGATG GGTATGCTGA TGTGGTTGTG GCCAATGCTG GTAGTGATAG TGTCAGCGTG TTGATTAATA ATGCCAATGG TACGTTTAGT TCTGCGGTTC ATTATGGGGT TGGCAATATT CCCCATAGCG CTGGGATTGG CGATATTGAT GGCGATAATG ATAACGATAT TATCGCGGTT AATCGTTGGG AACAAAGTAT GACTCGCTTG ATCAATAATG AGAGTGGTAG CTTTACGCCG CTTGCTCCAA CGATTTTCTT GCAAGGCCCA AGCGATATTG AAATAACTGA TCTTGATGGA GATGGCGTGC TGGATATTCT GGCAACCAAT ACGGTTAATG ATGTTGATCC TGGCACGGTC AGTATTTATT TTGGCTTAGG CAATGCCAAT TTTAGCAGCC CACAACTGGT AACCTCAGGC GTGCACCCAA CGTCGTTAAT CTATGCCGAT TTGAATAATG ATGGTTTGGT TGATATTGCG ACTTCAAACT TTTATGGCAA TAGTATCAGC GTTTTGTTAC GGCGAGTTCC TGCTGCAACC AGCACGCCAA CCATAACCCC GACGGCCACG AGCACACCAA CCGCAACCAA TACACCAACG GCCACGCCAA CGAGCCAACC AAGCGTGACT CCGGTTGCTG GTAGTTCAAC GACCTTTTTG CCGTTGGTTA CTGATAGTCG CCCAATCTTC CCAATCGTGA TTAATGCCGT GGCTCAGCCG TTGATTCCAA TCACTCAACA AGGCCAAATT TATTACACCA CGACATTAAC AATCAATACT CCATTGCCAA CGACTGGGCG CTTCTATCTT TCATCGCGTC CTGACGCGAT TGCCGAAGTT CGGGTTGATG ATCAAATGAC GGTTTGGGCT GATAATGCGG TGTTGTACGA ACGTAGCTTA ACAACGCCCC AAGTTGTTGA GATTTCGCGC AGCGAGTTGA CATCATGGCT TGATCAAGAG CTAACCATCA CCTTCCGCGA TGTAGCTGGC TCGGTTTACG GCAATAGCGC GGTGTGGTTG ATTTGGGTTC CTTAG
|
Protein sequence | MKTTMVFRRA VLASIGTIVV SSLLVVNANS RGLLAQTLPN PIDFHPVVTY SSPSRPCELG RGDFNGDGFV DLATANQASS EVQIFLNNGA GAFPTHTTYS VATPCGIDVG NVDGDNDLDI VVTKQTSNQL GVLLGNGDGT FQIAQSFSTG ARPTDVILRD LNQDTELDAV ITNQDSQGVS ILLGNGNGTF ANQTIYTVQA SPTLEAVGDL TGDGYADVVV ANAGSDSVSV LINNANGTFS SAVHYGVGNI PHSAGIGDID GDNDNDIIAV NRWEQSMTRL INNESGSFTP LAPTIFLQGP SDIEITDLDG DGVLDILATN TVNDVDPGTV SIYFGLGNAN FSSPQLVTSG VHPTSLIYAD LNNDGLVDIA TSNFYGNSIS VLLRRVPAAT STPTITPTAT STPTATNTPT ATPTSQPSVT PVAGSSTTFL PLVTDSRPIF PIVINAVAQP LIPITQQGQI YYTTTLTINT PLPTTGRFYL SSRPDAIAEV RVDDQMTVWA DNAVLYERSL TTPQVVEISR SELTSWLDQE LTITFRDVAG SVYGNSAVWL IWVP
|
| |