Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2856 |
Symbol | |
ID | 5736893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3625402 |
End bp | 3627084 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641279999 |
Product | hypothetical protein |
Protein accession | YP_001545622 |
Protein GI | 159899375 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00133424 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACCAA CCAACTGGCA GCGCGAGGTT GAACAAGCCT TAGCTCAAAA CGACCACAAT CGAGCCAAAC AGCTTTTAGC CGCCGTGATT CGCCAAAATG TGCATGAACG CGAGGCCTGG CGCATTTTAG CGAGCATCGT CAATGATCCA GCCCAACAGG CCGAGTGTTT GCGCCGAATT GCCGCAATTG ATGCCGCCAC GCCAACTGTG ATTAAGCCAA TTAAGCCAAT TGCTGCCGTA GCACCAAGCA CTGAGCTACC AAGCGATCCA AGCACTAGCC CAACCACGCC CTTGGTTGCT GTACAGGATA CGCCTAATTT TACGCCATCA AGCATTCAAA CTGAGCCATT ATTTGATTCA CAAGCTAGTT TCACGCTGCC AGCCAATTAC AGTATGGGCC AAGCAACCCA GAAACTGGAT CAACCAATGC CGCAACCTGC TCGTTCGTTG CCTAAAATGT TAATGTTGAT TGGAGCAAGT TTGTGTGGTT TGGGCTTGTT GCTTTTGCTT GGTTTACAAT TTTGGCCATT GTTAAATGAT ATCAACTCAA CGCCCGCTAG CGAACAACCG CTGCTAGCCG TGGTTGAACT CACGATTGTA CGATCGGGCG TTGGTCAAGT GCCGATGACC GATCAGGCCA TGGTCTATTT TGAAATTGAA AATCCCAGTG ATCAAGCACT GTATAACATT CCCTATACCA TGACCTTGAC CAGCCAATTT GATAAAGTGC TGAAAAATAC CAATACGATT GAATTATTGC TGCCCAAGCA AAAAATCGTG GTTGTTGATG GGATTTTTAC TGGGGATAGC TTTCGCGCCA AAAGCGTCGA AATTAGCCTA GGCCAGGGCT ATCCAATCAC AATTGATCAA CCAATTCCAC AAGTTGAGGC GCTTACTCCA ACCACGTTTG GCATGCCCTT TCGTGGGCGT TTGCAGGAAA GTTGGGGCTA CGAATACCCA ACTTATCATG TTAGCGCGAT TGCAACCGAT AGTGGTGTGG TTACTTCAAC CTTGGTATCA ATGCTGTTTT ACGATACGAA CGATCAAATT ATTGGCGCTG GTTCGGGCTA TATCGATTAC ATTAATAAAA AAACTTCATT GCCAAATACT CGCCGTGGTG AAACGATGGT CTATCCCATT ATTTGGGGCA CGAGTGAGGT CGATCGGGTT GAAATTGTGC CTGCATTTAC TTTAGCAACA TTTAGTCAAC AACCCCCTAT TAAGCCTAAA AAGGCTGAGA TCGATCAAAT GTGGCTAGGT GAATATATTC CAGAGGTTGA TCCACCATAT CATGTACAAG GTATTGTTGA ATATCCAACA TTACCGCCAA CCTCAGGCTA TCATGCGATG GTGCCAGCAG AATGGGGGCA TTACGATAAT TTTATTACTG ATGAGGCAAT GGTACATAAC TTAGAGCATG GTGCGGTGAT TATTTTTTAT AACCCACAAT TAATTACTGC AAATGAGCTT GGCCAATTAG AAGCAACTTT TGAAAATCTT TATCAACGCG AACATCATAC AATTTTGCAA AAACGCTTCG ATTTGGATGC AACTGTGGCA ATGACTGCGT GGGAATATCG TTTGATGCTG CGCGATAATG TTAATCTCGA TGCAGTCAAC ACGTTTTTCA GTGAACATAT TGCCCGTGGC CCTGAATGCG TTAATTTACG CTGTCCTAAT TAA
|
Protein sequence | MEPTNWQREV EQALAQNDHN RAKQLLAAVI RQNVHEREAW RILASIVNDP AQQAECLRRI AAIDAATPTV IKPIKPIAAV APSTELPSDP STSPTTPLVA VQDTPNFTPS SIQTEPLFDS QASFTLPANY SMGQATQKLD QPMPQPARSL PKMLMLIGAS LCGLGLLLLL GLQFWPLLND INSTPASEQP LLAVVELTIV RSGVGQVPMT DQAMVYFEIE NPSDQALYNI PYTMTLTSQF DKVLKNTNTI ELLLPKQKIV VVDGIFTGDS FRAKSVEISL GQGYPITIDQ PIPQVEALTP TTFGMPFRGR LQESWGYEYP TYHVSAIATD SGVVTSTLVS MLFYDTNDQI IGAGSGYIDY INKKTSLPNT RRGETMVYPI IWGTSEVDRV EIVPAFTLAT FSQQPPIKPK KAEIDQMWLG EYIPEVDPPY HVQGIVEYPT LPPTSGYHAM VPAEWGHYDN FITDEAMVHN LEHGAVIIFY NPQLITANEL GQLEATFENL YQREHHTILQ KRFDLDATVA MTAWEYRLML RDNVNLDAVN TFFSEHIARG PECVNLRCPN
|
| |