Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4903 |
Symbol | |
ID | 5736739 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6240018 |
End bp | 6240905 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282070 |
Product | hypothetical protein |
Protein accession | YP_001547661 |
Protein GI | 159901414 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.190776 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTAC AAGAACTCAT GCGCCAAGTG CGCTTCATTG AATTGAAAAC GACCAAGTTG GTAACTGGTG TGTTTGCTGG CATGTACATG AGCAGCTTCA AGGGGCGTGG CGTTGAGTTC GATGAAATTC GCGAATACGA GCCTGGCGAT GATGTCCGTG CGATCGATTG GAATGTTACG GCGCGGACAG GTCGTCCGTT TATCAAGCGC TTCGTCGAAG AGCGCGAGAT GACCGTGATG CTGTTGGTCG ATATGAGTGC TTCGGCAGAT TTCGGTACCA CCCGCAAACT CAAACGCGAA CTGGAAGCCG AATTATGCGC AACCTTGGCA TTTTCGGCAG TACGCAATAA CGATCGCGTG GGCATGCTGC TGTTTACTGA GGAAGTTGAA CGCTTTATCC CGCCGCGCAA AGGCCGCAAC CATGTGATGC AAATTGTGCG CTCGTTGCTC ACAACCGACC CTGAGCATCG CGGCACCAAC ATCACCAAAG CCTTAGATTA CCTCAATAAC GTGGTCGAAG GCAAAGCCTT GGTCTTTATC ATCTCGGATT TTCGCTCGCT CGATAATTGG ATTCGACCGT TGCGGATTAC CGCGCGTCGC CATGATGTCG TCGCAGTGCG CGTAGAAGAT CCACGCGAAC GTAAATTGCC CAAGGTTGGC TTGGTACGGC TACAAGATGC TGAAACCGGC CAAGAGTTGG TCGTCGATTT GCGTAATGAA AAATTACGCG ACATGTTTGA AAAGCAGGCC GAAGCTCAGC ACCAAAGCCA TGTGGCCGAA CTACGGGCCT TGGGCGTTGA TCATATGAGT TTGCAAACTG ATGGGCGTTA CGCCGATTCG CTGCAAAGCT TCTTTAATCG TCGGATGAAA CGCCGTGAAC GCGGCTAG
|
Protein sequence | MNLQELMRQV RFIELKTTKL VTGVFAGMYM SSFKGRGVEF DEIREYEPGD DVRAIDWNVT ARTGRPFIKR FVEEREMTVM LLVDMSASAD FGTTRKLKRE LEAELCATLA FSAVRNNDRV GMLLFTEEVE RFIPPRKGRN HVMQIVRSLL TTDPEHRGTN ITKALDYLNN VVEGKALVFI ISDFRSLDNW IRPLRITARR HDVVAVRVED PRERKLPKVG LVRLQDAETG QELVVDLRNE KLRDMFEKQA EAQHQSHVAE LRALGVDHMS LQTDGRYADS LQSFFNRRMK RRERG
|
| |