Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3117 |
Symbol | |
ID | 5734989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3932678 |
End bp | 3934960 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641280261 |
Product | hypothetical protein |
Protein accession | YP_001545883 |
Protein GI | 159899636 |
COG category | [S] Function unknown |
COG ID | [COG1300] Uncharacterized membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.247741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAACC AAGCCTTAAC CATTACACGC CGCGACTTAC GCGATACGCT CAGCGATTGG CGCACATTGA TTCCGTTATG CATTCTTTCG GTGTTGTTGC CGCTAATTCT GCGGGCGGGG GTAGCGCAAG CAGTGAGTTT TCTCGACGAT GAATTTATCA ACCGCAGTTT AGTGCCATTA GGTCTGTTGA TTGCGGGATT CTTGCCAGCC TCACTCTCCT TGATTGGCCC CTTGGAATCG TTTGTCGGCG AACGTGAACG CTCAACCCTG GAATCGTTGT TGGCCATGCC AATGAGCGAT CGTGGACTCT ATCTAGCCAA GTTTTTGGCG GCACTATTGC CACCAATTGG CTCCTCGGCA GTGGCGATGC TGATGTATAG CCTAGCGATT GAGCTTGGGC GACCAGTACG CGTAGCCTTT GTGCAAAAGC TGCTCAGCGA CTATTTAACC AGCGGCTGGA TTATTGGCAT CACATGTATT TTGTTGATTA AAGTTTTTAC CATGGTTGCG GCAGCTGTCT ATGTTTCGTC ACATACTACC AGCATTCGGG CAGCCAACTT ATTAGCCAGT TTCATCCTAA TTCCAATGGC AATTTTGGTC CAGATCGAAG CACTGATCAT CATCAACGGC ATGTTTTTGC CAATTGTGTT GATTAATGGT TTGTTGCTCG TGGTTGGTCT GACCATGGTT GGTTGGGGCA TGTATAGCTT TAATCGCGAA GAATTGCTCT CACGTGAGCA TGAAAGCATC AGCAAACGAG CCTCAACCCA ACGCCTGAGC CAATCGACCC GCACCTATGG CCCTGTGATG ACCATTGTAC AGCGCGAGGC TGTTGACACG CTCAGCGACT GGCGAATTTT GGTGCCAATT GGCCTCTTGA CCTTTTTAGT GCCAATTGGC GCTTTGTTTG GGGTCATTTA TGCTTTTGCT AATGTCGATG ACCCGACTGG GGTGGTTAAT CTATTGCCCT TTATCGTGTT GCTGGTGGGC TTCTTACCAG CCTCATTTTC GCTGATTGTG GCACTTGAAG TTTTTGTTGG TGAGCGCGAA CGTGGGTCGC TGGAGTCATT ATTCTCAATG CCGATCAGCG ATGGTCAGTT GTATCGAGGA AAGTTATTGG CGGCGATCGT GCCACCAATT GGCGCAAGCT TGGTTGGCAT GCTGTTGTTT GGGGTTGGAC TGAGCGTCTT TGCCCCAACT GCCTTGCTGG GACGGATCAA CTTAAGCATT TTTAGCCAAA TGGTCTTGCT GAGTATTGCC CAAGCCTTGA CGATGGTTTC AGCAGCGGTG GTGATTTCGT CGCATACCAA CACCGTGCGC TTAGCCAATT TATTGGCAAG CTTTATTTTG ATCCCAGTGG CGATTATGGT ACAGCTTGAA GCGGTGCTGA TTATCGGCGA ACGCTTCGAC GTGCTGAATG CGATAATGGC AGTAATGTTA ATTTTGACAA TCGTCTTGAC GCGTACCGGC ATTGGTAGCT TTAAGCGTGA ATCAATTCTA TCGCGTGAGC ATTTAGCCCT CAATTTTAGT CAAATCAATC GAGCCTTCAA AGCCTTTTTC AGCGAAATTC GCCCAGCAGG CAGCAACCCC GATAGCCATC TCGGCTTATT CAATGTTGAA ACTACTAATG GCCCACAACG CAATCTAGGA CTGTGGCTCA AACGCTTCTA TCGCCAAGAA TTAGCTGTCG TTTGGCGCGA GACCCGACTA GCGCTGGTTG TGGTGCTCCT CTTTTGTGGC GCAGCAATCC TTTTTGGTAG CCAATTTAAT CCAGTCAGTA GCCGCGAACA ATCATTGGCC AATGTTATGA CGCGCTTAGA CGTAGGTCAG GGCACACTCT GGGAGCCGCC ATTAAGCTTT ATTGTCATTA GCAATAGCTT TGCAATCTTT TTTGCTGGGC TATTTTCAAG CATCACTTTA GGCTTTTTTG GCTTGATTCT GCCAGCCATT AACCTAATGG GATTGAGTTT TCGAACCAGT AGTTTAGCCG CAAGCGGCGG CCTAGCAACT GCTCTCAATT ATCTAGTGGG CTATGAATTG CCGCATGGGT TATTGGAAAT TCCGCTAAGT ATATTTGCGG CAGCCTTGGC ATTACGTATG GGTGCGGCTT TGGCGTTTGT GCCACCAACC TATAGCGCTG GGCGGCATTT GCTCTGGGCT TGGGCCATGT ATCTCAAAGT ATTTTGCTTG CTAATTGTGC CTGGACTCGT GCTGGCGGGC TTGATTGAAG TGTTGGTAAC CCCAGCTGTG CATCAGATGG TCTATGGGTT TCTGTGGATT TAG
|
Protein sequence | MRNQALTITR RDLRDTLSDW RTLIPLCILS VLLPLILRAG VAQAVSFLDD EFINRSLVPL GLLIAGFLPA SLSLIGPLES FVGERERSTL ESLLAMPMSD RGLYLAKFLA ALLPPIGSSA VAMLMYSLAI ELGRPVRVAF VQKLLSDYLT SGWIIGITCI LLIKVFTMVA AAVYVSSHTT SIRAANLLAS FILIPMAILV QIEALIIING MFLPIVLING LLLVVGLTMV GWGMYSFNRE ELLSREHESI SKRASTQRLS QSTRTYGPVM TIVQREAVDT LSDWRILVPI GLLTFLVPIG ALFGVIYAFA NVDDPTGVVN LLPFIVLLVG FLPASFSLIV ALEVFVGERE RGSLESLFSM PISDGQLYRG KLLAAIVPPI GASLVGMLLF GVGLSVFAPT ALLGRINLSI FSQMVLLSIA QALTMVSAAV VISSHTNTVR LANLLASFIL IPVAIMVQLE AVLIIGERFD VLNAIMAVML ILTIVLTRTG IGSFKRESIL SREHLALNFS QINRAFKAFF SEIRPAGSNP DSHLGLFNVE TTNGPQRNLG LWLKRFYRQE LAVVWRETRL ALVVVLLFCG AAILFGSQFN PVSSREQSLA NVMTRLDVGQ GTLWEPPLSF IVISNSFAIF FAGLFSSITL GFFGLILPAI NLMGLSFRTS SLAASGGLAT ALNYLVGYEL PHGLLEIPLS IFAAALALRM GAALAFVPPT YSAGRHLLWA WAMYLKVFCL LIVPGLVLAG LIEVLVTPAV HQMVYGFLWI
|
| |