Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3653 |
Symbol | |
ID | 5735514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4593973 |
End bp | 4595502 |
Gene Length | 1530 bp |
Protein Length | 509 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641280802 |
Product | hypothetical protein |
Protein accession | YP_001546417 |
Protein GI | 159900170 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000579216 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCGAA CACGTTCCGT GATTGTTGGT TTGGTGGTAT TGCTGTTGGT AGCGCTTGGT GCTAGCGGTT GGCTCTGGTT TGAACGCGGC ACAATTATCT CGCGCGATGC CACGACCCAA GCTGAATTTG GCAAATTAAA AGATCAAATT ACTGCTGCTG ATGAGGTGCA AGATAAAAAC CGTCAGCTGC AAGACCAAGT GAATGATTTA CAAGAGCGTT TGAATAACCC ACCAACCAGC ATTGCTGAGC CGACCAGCCC AGCGCTTGAG CCAACGCCTG AAGCTGGCCC GACTCCGACG GTTGATCCGG CTGCCCCGAC CCCAACCGGC CCAGGTGGCG TGGAGCCGCC AGCACAAATT GTTGAAGTGA TGAAGCAAAT TGAGCAAGAA GTGATCGCGT TGCGTGGTTT GCCAGAAGAA CGCCCCGTCA CACGACGCAT GCTCACCCGC GATGAATTGC GTGATTATAT TGTGCGCGAA ATGGAAACCG AAAATACTCC CGAAGATTTT CGCCGTGAAA CCAGCCAATT GTGGATGCTT GGTTTAGCCG AAAAAGATAT TGATTTGCAA CAACTCTATA TTGATTTGCA AACTGAGCAA ATTGGTGGAT TTTATGATCC CGAAACTGAT ACCTTCTATA TTATTGCTGA AAACAGTGAA TTTCCACCAG CTGATCAAAT TACCTATGCC CATGAATTTA ATCATAACTT GCAAGATCAA TTGATTAATC TGCAAGATGG CCTGAAAGTT GGCGAATTTG ATGCTGATCG ATCTTTAGCA TTTCGTTCGT TGGTTGAAGG CGATGCAACC AAATTAATGA GCGATTGGTT GCAAAACGAT CTTATTCCAC GCATGTCACC TGCCGAGTTG CAAGAATTAT TGCGCACCTT GCAAGAACAA CAAGATAGCA GCAGCATTCT TGATCAAGTG CCTGGCGTGT TGCGCGATGG GCTAGTCTTT CCTTATGAAG ATGGTTTAGC GTTTGCTGAA GCAGTTTATG CCGAAGGTGG TTGGGAGGCA GTGACTAAGG CGTTGCAAGA CCCACCAACC TCGACCGAGC AAATTTTGCA CCCTGAAAAA TATTTGAGTG CCACCCGCGA TAACCCAACC CTGCCCGATC AATTTGATCT GTTGCCAGTG CTCGGTGCTG ATTGGACAAC CGCTATGACC AATACGGTTG GCGAGTTCGA TGTTAAAGCG TTGCTCGAAT ATACCGCGAC TGCTGGCGAT ATGGAAGCTG CGGCAGCAGG AATTGGCGGC GGTCGCATGA CCCTGTATGA ACACAACAGC GATTTCACGC CTGTGTTGCA ATGGACATTG CGCTGGGATA GCGCCGCCGA TGGTGATGAA TTTTTGAGCT TATTCAATGG TACGCTTAAC CCAAATGGCG ATTTGCTGCT ACGGGCTGGA GATCCAAACC GAAGTGATGA TGATGTCCAT GTTGGAGTCA AAGGCAGTGG CCAAGAATTT GTGATTATTT TTAGTTCGAA CCAAGATTTG GTGCGCAATG CCTTGAATGC CTTACCCTAA
|
Protein sequence | MNRTRSVIVG LVVLLLVALG ASGWLWFERG TIISRDATTQ AEFGKLKDQI TAADEVQDKN RQLQDQVNDL QERLNNPPTS IAEPTSPALE PTPEAGPTPT VDPAAPTPTG PGGVEPPAQI VEVMKQIEQE VIALRGLPEE RPVTRRMLTR DELRDYIVRE METENTPEDF RRETSQLWML GLAEKDIDLQ QLYIDLQTEQ IGGFYDPETD TFYIIAENSE FPPADQITYA HEFNHNLQDQ LINLQDGLKV GEFDADRSLA FRSLVEGDAT KLMSDWLQND LIPRMSPAEL QELLRTLQEQ QDSSSILDQV PGVLRDGLVF PYEDGLAFAE AVYAEGGWEA VTKALQDPPT STEQILHPEK YLSATRDNPT LPDQFDLLPV LGADWTTAMT NTVGEFDVKA LLEYTATAGD MEAAAAGIGG GRMTLYEHNS DFTPVLQWTL RWDSAADGDE FLSLFNGTLN PNGDLLLRAG DPNRSDDDVH VGVKGSGQEF VIIFSSNQDL VRNALNALP
|
| |