Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1461 |
Symbol | |
ID | 5733346 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1703461 |
End bp | 1704681 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278599 |
Product | hypothetical protein |
Protein accession | YP_001544233 |
Protein GI | 159897986 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000257739 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGGAG TGCCAGCAAT TGATCCACGG CTCAATCTGA CCCAACGGGG CGGGGTTATC ACGATTGGCA ATACGCTAGG CCATGATTGT GCGGCGGGCG TTCCCGCTCC AATAGTGGGT GCAGTTGGTA CGTGTGGTAC AAATACGAGC GATTCAGGCG TTGACATTTA TTGGCGAGCC GATTCGCCTG CTGTCAGCCA AGCTGAAGCG AATACCAGCC TCAATAATGC CCAGGCTCGC AGTAGTGCAA TGCTCAATAT TCCGGTCGGG GCCAGCGTAA CCCATGCCTA CCTCTATTGG GGAGCGCGTT GGGGTGGCGT AACACCAACT GGTGCTACGC TTGAATTTAA TTGGGGCAAT AGCCAGAACG TCACACCCAT CAGTACAGTT ATCAATCCAA ATTCATTTTA TCAAGCGGTC GCAGATGTTA CCAGCTATGT TCAAACGCAT GGCTCCGGAG CCTATCGTGC CAGTGGCATT CTCGCTAATC CGTTTATTAA CACCAATGAT ACTAACGCCT TTGCCGCTTG GTGGTTGGTA GTGGTCTATG CTGATGCAAG CCAACCCAAT CGAACGATCA TTCTCCACGA TGGGCTGGAT CAAGTATTAA TTGGCAGCTC AATTACCAAT ACATTGAGTG GATTCAATAT TCCCAGCAGC AGCACAACCA ACCTTACGAT TGTGGGCTAC GAGGGAGATA ACTCAAGCGC CGGTGACCAA TTGCTGTTTA ATGGTAATAG CGTTAGCAAT GCGCAAAACC CTACCGATAA TATGATGAAT AGCACTCGTT CATATTTAGG CAACCCAGTT TCAAGCGCAG GCGATTTGCC ACAACTAACC GGAGCAGCCA ACAGTATGGG TGGGTTTGAC CTTGATACCT TTGATGTTTC AGCATGGACT AGCCCAGGCC AAACCACAGC TAGCATAACC ACTGGCTCCA CTAGCGATAT GTATTTCGTT GGGGGCTTGA TTTTAGCGAT CACTTCGTTG GTTGCCGATA CTCCAACGCC CAGTAACACC CCAACCAACA CCCCGACGAA TACTTCTACT GCAACGAACA CGCCGACCAA CACACCAACG AATACCGCAA CCAACACGCC GACCAATACC GCGACCGCAA CGAGTAGCCC AACCAATACT CCAACCAGCA CAGCGACCAA TACTCCCACA CCAAGCCTAT GGCGAACCTT CTTGCCAATC GCAATTCGCT CAGCTGAATA A
|
Protein sequence | MEGVPAIDPR LNLTQRGGVI TIGNTLGHDC AAGVPAPIVG AVGTCGTNTS DSGVDIYWRA DSPAVSQAEA NTSLNNAQAR SSAMLNIPVG ASVTHAYLYW GARWGGVTPT GATLEFNWGN SQNVTPISTV INPNSFYQAV ADVTSYVQTH GSGAYRASGI LANPFINTND TNAFAAWWLV VVYADASQPN RTIILHDGLD QVLIGSSITN TLSGFNIPSS STTNLTIVGY EGDNSSAGDQ LLFNGNSVSN AQNPTDNMMN STRSYLGNPV SSAGDLPQLT GAANSMGGFD LDTFDVSAWT SPGQTTASIT TGSTSDMYFV GGLILAITSL VADTPTPSNT PTNTPTNTST ATNTPTNTPT NTATNTPTNT ATATSSPTNT PTSTATNTPT PSLWRTFLPI AIRSAE
|
| |