Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4295 |
Symbol | |
ID | 5736154 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5482942 |
End bp | 5484849 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641281455 |
Product | hypothetical protein |
Protein accession | YP_001547055 |
Protein GI | 159900808 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGTT GGATTGTGTT GGCAATTTTC GTCGTCGGTT TAGGGCTTCC CCAAGCCACA CAAGCTACCG AATTGCAATG TTTTGAGCAA ACAGGCTTTT GTACCGATGG TCGTTTTTTA GAGTATTGGC GGCAAAATGG TGGCTTAGAA GTCTTCGGTT ATCCGTTGAG CACAATTGAC ATCGTTTACA ATCAGGATAG CCAGATGCAT TTCCTGACTC AGCAATTTGA GCGTGCTCGC TTTGAGTTTC ACCCTGAATT TGCCGCGCCC TACGATATGT TGCTTGGGCG GTTGGGCGAT GATCTGTTGC GCTATCGTAA TATCGATAGT GCCATGTTGC CACGTGAGGC TGGCGCAACA TCAGGCTGTT TGTGGTTTGA AACAACTGGA CACAACGTCT GTAACCAAGC CAATGGCCTC GGTTTTATGA GCTATTGGCA AAACCATGGC CTCAACGATC CCAAACTTGA TGCCTTTGGC CGTTCATTGC AATTATTTGG CTACCCGCTG ACTGAGCCAG CCATAGAAAC CAATGCGAAT GGCGATAGTG TACTCACCCA ACATTTCGAA CGCGCCCGTT TCGAGTGGCA TCCCAACCAA CCTGATCAAT TCAAAGTGCT GCTTGGCTTA GTCGGCAAAG AATCGCAAAA ATTAGTCTAT GGCGCAAGTG CCGATCCATC AAAATTGACC TTGGTTGGCG ATACGCTCTT TTTCACCGCC GACGATGGCG TGCATGGCCG TGAATTATGG ACAAGCGACG GCACCGAGGT TGGCACACGC TTGGTCAAAG ATCTGAGCGT TGGCACTGAG TGGGGTGGAA TTTATGAATT AGCCGCTGTC AATCAGGGCG TTATTTTTGC CGTCAACAAG CAGGATAAGG ACTATCAATT ATGGTATAGC GATGGCATTG AAGCTGGCAC ACGCTTAATC AAAAGCTTTG TTCCAAACGC TAAGATCAAC AATCTCAAAT CATTAGGCAA CGGGATCATC TTTTGGGCAA ATGATGCAGT TCATGGGATT GAACCATGGT ATAGCAACGG CACTGAAGCT GGTACATATT TGCTGAGTGA TATCAACCCA GGCCTTGCAG ATTCAGCAGT TTCCAACAAT TATGGATCTT ATTGGGTTGA CTATGCCCCC ATCGCGGGAG GTATGGCCTT TTTTGCCCAA AATAACCAAA TTGGCAATCA AATCTGGTGG ACAGATGGCA GCATTGCCAA CACTCGCCAA ATCAGCAATT TAGCAATCTC ATTTGGTATG CTTGAGCTAG AAGTGTTAGA TCAGCAACAT TTGATCGCAA CAGCCTATCA AAATACAACG ATGGGGGTTT GGAATATAAC GCTTGCTACT GGCGAACAGC AACTATTAGC CAGCTATCCT GCAATTGCCA CAACCCGTAA TCCAGCATCA GCTATACAAC TAACCCAAGC TGGTGGAAAG GTCTATTACC TTAGCAAAAC CCAAGCAGGC GAGCTTAGCC TATGGCAAAC TAATGGCCAA GCCGATCAAA CCATCCAACC CAATCTGCAA GGCTACAACG CCGAACATAT CGTAGCTGCA AACGATCAAT TGTATATGCG ACTGACTAAT CCTCAAGGCA TTCAGGCTGG CTGGTGGTAT TTCGATTCAA GCCAAGGGTT GACTCAACTA ACGCCATTAC CACTGCATAT CCATGCCGCG AACAATCGCC TATTGGGATG GGAATCGATC GCTGGAGGAT TACGGTTCTA TAGCACTAAT GGGCCAAATC AAGCCCTACG CTATCGTAGC TCAGTTATGG GCAAGCAGAC ATACTTCCCT GATACGACCA ATGATCGATT TTTCGCCATC CCTAGTTTTC AGTATGGCAC GGAGCTATGG TCTAACGATG GCAGCACCCT GCGGATGGTC AAAGATATTC AGCCATAA
|
Protein sequence | MKRWIVLAIF VVGLGLPQAT QATELQCFEQ TGFCTDGRFL EYWRQNGGLE VFGYPLSTID IVYNQDSQMH FLTQQFERAR FEFHPEFAAP YDMLLGRLGD DLLRYRNIDS AMLPREAGAT SGCLWFETTG HNVCNQANGL GFMSYWQNHG LNDPKLDAFG RSLQLFGYPL TEPAIETNAN GDSVLTQHFE RARFEWHPNQ PDQFKVLLGL VGKESQKLVY GASADPSKLT LVGDTLFFTA DDGVHGRELW TSDGTEVGTR LVKDLSVGTE WGGIYELAAV NQGVIFAVNK QDKDYQLWYS DGIEAGTRLI KSFVPNAKIN NLKSLGNGII FWANDAVHGI EPWYSNGTEA GTYLLSDINP GLADSAVSNN YGSYWVDYAP IAGGMAFFAQ NNQIGNQIWW TDGSIANTRQ ISNLAISFGM LELEVLDQQH LIATAYQNTT MGVWNITLAT GEQQLLASYP AIATTRNPAS AIQLTQAGGK VYYLSKTQAG ELSLWQTNGQ ADQTIQPNLQ GYNAEHIVAA NDQLYMRLTN PQGIQAGWWY FDSSQGLTQL TPLPLHIHAA NNRLLGWESI AGGLRFYSTN GPNQALRYRS SVMGKQTYFP DTTNDRFFAI PSFQYGTELW SNDGSTLRMV KDIQP
|
| |