Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4080 |
Symbol | |
ID | 5735939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5212417 |
End bp | 5213433 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281232 |
Product | hypothetical protein |
Protein accession | YP_001546840 |
Protein GI | 159900593 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGACG ATCCGTTAAC CTCAAACCAA GCAGAATTAG CAGAGGCCGA GCCACGTCGG AGTAGCTCGG CCCCACCTCA ATTACGTGCG GCGATCAGTC GGGCATTATC ACGGCGCAAA GCTGAATTGC AAAGCCAAAG CGTTGCTGAG GAAACGGTAG CTGTTGAAAT TGCGCCAACT ATCGTTGCCG AGCCAGAGCC GCCGATGCCG CTACGGCAAG AGCGTTATCG GGTGGCGGCA AGTCGCGCAT TAAGTCGTTT GCGGCCAAGC AGTGAAACTC CACAGATTGC TAATTCAACC AGCAACCGAG GCCGTGATCG CACTCGTACT CGGTTGCGCC CGCCAACCTT GCCTAACCAA CCTTTGCCCA AACTGCCAAC CGATCAACCG GTGCGGCGCG AATTGCCAGC CGCAGTTGTC GCAGCCTTGC AGCCCGATAG CCTGCCCGCA CCAGAAATTC AGCGCCCGCC ATTGTGGGTG GCCTTCGTGG CAGGCTTAGG TTGGACAGCG ATGTTGCTAT GGGTCTTTTT TGGCGGGCTT TCGCCGCAAT CGGGCTTTGG AGTGCGTAGT TTGTTTGGCC TGGCAACTGT GGCGTTGGGC TTAACCACGT GGCTGCCAAT GCAATGGGCC TTGCGTTTGC CAGCATTAAC GTGGCGTGGC ACGGTTGGTT GGGCCATCAT TTTCTGGATC TTGGCTTTTG TGCCCGCACC AACTACCACG ATTGCCGCAG GATTGCCCGA TTTGCCGATC TACTTGATCT TTCTGAGTGG GGTATTTTTG GCTAGCAATG CCATGGTTTT GCCAATCGTC TATGTCTGGG GCATTCGGCG TTATCATAAT CGGGCGCAGC GCTTTGATGT GCGGCGTTCG CATCGCCAAG CAGCCGAGGC TGGAATTTTT TGCAGTTTGT GTGTGCTTTT GGCAGCAATG AACATTTTCG AGCCAGTTAC AGTGTTATTG TTAATCGCCA TTTTGGTATT GAGCGAAATG GTGATTTTAT CGTTACAGGC CAAATAG
|
Protein sequence | MTDDPLTSNQ AELAEAEPRR SSSAPPQLRA AISRALSRRK AELQSQSVAE ETVAVEIAPT IVAEPEPPMP LRQERYRVAA SRALSRLRPS SETPQIANST SNRGRDRTRT RLRPPTLPNQ PLPKLPTDQP VRRELPAAVV AALQPDSLPA PEIQRPPLWV AFVAGLGWTA MLLWVFFGGL SPQSGFGVRS LFGLATVALG LTTWLPMQWA LRLPALTWRG TVGWAIIFWI LAFVPAPTTT IAAGLPDLPI YLIFLSGVFL ASNAMVLPIV YVWGIRRYHN RAQRFDVRRS HRQAAEAGIF CSLCVLLAAM NIFEPVTVLL LIAILVLSEM VILSLQAK
|
| |