Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1205 |
Symbol | |
ID | 5733098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1387981 |
End bp | 1389594 |
Gene Length | 1614 bp |
Protein Length | 537 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641278345 |
Product | hypothetical protein |
Protein accession | YP_001543981 |
Protein GI | 159897734 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000167007 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGATCAT TCCGTGCTGG CTATCGTTTA CTAGCATTAT TGCTGCTCCT AACGCTTGGT TTTGCACCTG CTGATCAAGC AACCTTGGCC CAAACACCAC CAATTGGCTC ATTAGTTGAT GCCAATGGTG ATGTTGGTGA GACGAGCGAT CTTGTAATTG GTCAAGACGG CCTAGGGTTG ATCGCCTATT ACGACACAAC AAATCAAGAT TTAAAAGTTG CCCACTGCAA AAATTTACAA TGCAGCCAAT CGAGCAGAGC GATTATTGAT AGCCAATATA ACGTTGGCAA ATATGGATCA ATCGTGATTG GCAACGACGG TTTAGGCGTT ATCGCTTATA TGGATCTGAC CAATCGCACA TTAAAAATCG CACACTGCCA AAATGTTGAA TGTAGTTCAG CAACAATCTC TACGATAACC TCAACCCTGT GGGTTACCTT TAAAACGGAT ATTGTCATTG GCGCTAATGG GTTTCCGGCA ATTCTCTATG CTGAAGATGA TGCTATTGCG ACGTTAAAAT ATGTACAATG CCATAATATC ACGTGTAGTG TATCAACAAT TACAACTCTT GATGAAGAGG TTTATGGATC ATTTGCAGGA AAAATGATCG TAGGTGGTGA TAATAAAGTC ATTATTGCAT ATAATAAATT GATCCATCAT TTATCAGTTG AATTAAAGTT CAAGTATTGT AATAATGAAG CTTGTAGTGA TGTTACAACT ACACAAATAT ATATAAATCC CGCAGACATA TTCTCTTATA TCCTTGATAT CACCATAAAT TCAAATAACT ACCCTATTAT CAGTTTTTAT GATATTGTTC AGGATCACTT ATTAGTTGCC CAATGTAATA CATTTAATTG TAGTGATTAT ACAATTAACC TTGTTGATAG TATAGGCGCT GAAAGTGGTA CAATGAGTAG TATTATGATT GGATTTGATG GATTACCTCT GATCGTTTAT CATATACGGC CAATTGCTGG TGTGCATGAT AATGACGATT TACGAATCGC ACATTGTGAG AATATTGCAT GCAGTAGTGC GACGATTACG ACGATTACAG CACATCTCGG TACATTCCTC GGGTATCATC CCACAATAAA AAAAGGGGCA GACAACTTAG GACTATTTAT TTATTTTGAT TTGATTAATA CCAATCTGCG GGTTGTTCAT TGTGCCAATA TTCTTTGCAC AAAAGAAGGC CAAGCAGACT TTGCGCAATA TTTGGCATTA GTTCAAGCTC CTATAACCAC GTTTAGCCTC TTAATCAATC AACAAACCAT TCCTGTTCAA CCTGTCGCCC AACAAGGCCA AGTATTTTAT CAAACAACCA TTAATGTTCC AAGCATTGTG CCAACTGGCG GTTCGTTTGT GCTAAGTGCT GATCCTGATG GGCAAACGCC TTCACTGGTT GATGATGCCG TGGTATTCAA ACTCAATAAT CAAGAAGTGT TTCGCTTTGA ATATACTGGT AGCATCGCAC CCTCCCCTAC TTTAGTCACA ATTCCCAATT CATTGGTTGC GCAATGGGCC GGCCAAACGC TAACTGTCGA ATTCCACGAT CGGTTTGCAG GCCAAATTCA AGCCTCAACC ATGTACTTAG TTTGGCTACC CTAA
|
Protein sequence | MRSFRAGYRL LALLLLLTLG FAPADQATLA QTPPIGSLVD ANGDVGETSD LVIGQDGLGL IAYYDTTNQD LKVAHCKNLQ CSQSSRAIID SQYNVGKYGS IVIGNDGLGV IAYMDLTNRT LKIAHCQNVE CSSATISTIT STLWVTFKTD IVIGANGFPA ILYAEDDAIA TLKYVQCHNI TCSVSTITTL DEEVYGSFAG KMIVGGDNKV IIAYNKLIHH LSVELKFKYC NNEACSDVTT TQIYINPADI FSYILDITIN SNNYPIISFY DIVQDHLLVA QCNTFNCSDY TINLVDSIGA ESGTMSSIMI GFDGLPLIVY HIRPIAGVHD NDDLRIAHCE NIACSSATIT TITAHLGTFL GYHPTIKKGA DNLGLFIYFD LINTNLRVVH CANILCTKEG QADFAQYLAL VQAPITTFSL LINQQTIPVQ PVAQQGQVFY QTTINVPSIV PTGGSFVLSA DPDGQTPSLV DDAVVFKLNN QEVFRFEYTG SIAPSPTLVT IPNSLVAQWA GQTLTVEFHD RFAGQIQAST MYLVWLP
|
| |