Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1961 |
Symbol | |
ID | 5733850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2395924 |
End bp | 2396925 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641279105 |
Product | hypothetical protein |
Protein accession | YP_001544732 |
Protein GI | 159898485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.215291 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAAC CAATTAATCA ATTACCCTGG GAGCATGCCG ATTGGTTCGA CCGTTTGAGC ACATGGATTG AGCAGCAGTT AGCGAGCCAT CAACGCAGCA TGCTTGAGCC AATTCAACTC GTACATCAAC GCCCATGGTC GGCCTTTGCC CAAATCCAGA CTAATCAAGG TATGGTCTAT TGCAAAGCGC CAGCTCCAGC CTTCCACTAT GAAGCGGCCT TGACCCAAGC CTTGGCCGAT TGGCAACCTG ATTGCAATGT GCCAGTTTTA GCGATCGAAC CAACCCAGGC TTGGATTCTC TCAGCCGATG CTGGCACAAC CTTACGTCAA CTTGGCCAAA ATCTGACCCA GCTTGAGCAT TGGTATGCGT TGTTGCCACA GTACAGCGAG CTACAAATAA ACCTCGCTCA ACGGGTTCCA GCTTTGTTAG CACTCGGCGT TCCCGATCGG CGATTAAGCC AGTTCCCAAC GTTGTTTCGC GAACTCTTGA ACGATCGCCA GCATTTATTG ATTGATCAAG AACTTGGCTT GAGTAGCAGC GAATATCAAC AATTGCAAGC ATTAGCGCCA ATGGTACAAG CACAGGCTGC CCAATTAGCC GAATTTGGCT TGCCAGAAAC GCTGACCCAC GAAGAAATTC ATGAAAATAA TGTGCTTTAC GGCGAGCGCG GCTATACCTT TACCGATTGG AGCGATTGCA GCGTGAGCCA TCCCTTTTTT TCGCTGCTAG TCACGTTGCG AGCCGCTGCC CATTGGCTCA AGCTCGATGA GCATGGCCCC GAGTTACAAC GGCTGCGTGC TGCCTATTTG GAGCCATGGA CACGCTTTGC GCCGCGCTCG CAGCTTGATC AAGCGGTGGA GATTGCCTAT CGGCTTGGGA TGATCAATCG GGCGCTTTCG TGGCGGCAAG CCTTGGACGG CCTCGATCCA GCTCAAACGC AAGAGTATCA AGATAATGTC GCAGGCTGGC TCCAAGATTA TTTAACAGCC AATACGGCTT AA
|
Protein sequence | MSQPINQLPW EHADWFDRLS TWIEQQLASH QRSMLEPIQL VHQRPWSAFA QIQTNQGMVY CKAPAPAFHY EAALTQALAD WQPDCNVPVL AIEPTQAWIL SADAGTTLRQ LGQNLTQLEH WYALLPQYSE LQINLAQRVP ALLALGVPDR RLSQFPTLFR ELLNDRQHLL IDQELGLSSS EYQQLQALAP MVQAQAAQLA EFGLPETLTH EEIHENNVLY GERGYTFTDW SDCSVSHPFF SLLVTLRAAA HWLKLDEHGP ELQRLRAAYL EPWTRFAPRS QLDQAVEIAY RLGMINRALS WRQALDGLDP AQTQEYQDNV AGWLQDYLTA NTA
|
| |