Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0002 |
Symbol | |
ID | 5736836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1544 |
End bp | 3412 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641277123 |
Product | hypothetical protein |
Protein accession | YP_001542782 |
Protein GI | 159896535 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000285538 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTTT TGGATCTACT TTTTGGGCGG CCACTGGCAA ATGAGGATGA GGAACATCAA CGAGTTGGTG TTGTAGCAGG GATTCCCATG TTAGGGTTAG ATGCGCTAGC CTCGGCAGCC TATGGCCCTG AGGCAGCTTT AACGATCTTA CTACCATTGG GTTTGCTGGG CATTAATGCC ATAACGCCGC TCGTTGCAAT TATCATCGTA TTACTGGGAA TCGTTTTTCT ATCCTATCGC CAAACAATCA CAGCCTATCC AAATGGTGGC GGCTCCTATA CGGTTGCCCA TGAAAATTTA GGAGTTATTC CTGGCCTCAT CGCCGCCGCC GCGCTTTTAC TCGATTATAT TCTTGTTGTA GCAGTTGGTA TTTCGGCTGG CGTAGGTGCA CTCGTTTCGG CAATTCCAAA ACTACAACCC TATATGCTCC CACTCTGTTT ATTAATTTTA GGGTTAATAA CGATTGTCAA CCTGCGAGGT GTTCGTGAAT CAGGGCTAGC ATTTGTGATC CCTACCTATC TATTCATTGC TTGTATGTTG ATTATCTTAG CCATGGGAGC TTATTTTGTA ATCATGAGTG GCGGTAAGCC AATCGCTAAA ATTGCTCCTG CGCCACAACC ATCAACCATG ACAACCTTGA GCTGGTGGCT CCTAATTCAA GCCTTTGCTA GTGGTTGTAC TGCCATGACC GGGGTTGAAG CGGTAAGTAA TGGGGTAAGT GCCTTCCGTC AACCAGCGAC CCATTATGCT CGCCGCACCT TAACAATTAT CATTGGAACA TTGATGGTGA TGCTTGCAGG CATTGCTTGG CTGGCTAAAT CGTATCAAAT TGGGGCGACC GAACCAGGCA AGGCTGGCTA TCAGAGCGTT TTATCGCAAC TTGTCGCTGC CGTGAGCGGG CAAGGCATCC TCTATACTCT AACAATTGGC TCTACTTTAG CAGTCCTCGC TTTGTCAGCC AATACAGGGT TTGCCGATTT CCCACGGCTC TGTCGGATTC TTGCCCACGA TCATTTTCTA CCCCATGCTT TTGCCTCACG CGGACGACGT TTAGTTTATA GCATCGGGAT TATAGTCCTA GCGAGTTTTG CTGGGATTAT CTTGATCATC TTTGGGGGGA TTACTGACCA TTTGATTCCA TTATTTGCGG TGGGAGCCTT TTTAGCATTC ACGCTTTCCC AAACCGGAAT GGTGCTGCAC TGGTTTAAGC ATGGTGGCCT CAACGCCCGA CGCAATATGT TGATTAATGG AGTCGGTGCA GTCTCAACCG GCATAACCTT GATCGTTATT TTAGTCGCAA AATTTGCGAC TGGTGCATGG ATTACCTTAG TCATCTTGCC AGCATTAGTT GGTTTATTTC TCGCGGTACG ACGACACTAT CAGCAAGTAG CCCAACAAGT ACAGCTGAAT ATTGCCTTAG ATACCAGCAA TTTAACCGCC CCGATTGTGG TTATTCCGTT TGGTGGTTGG AATAAAATGG CCCATAAGGC ACTACGCTTT GCATTAAAAA TATCACCTGA TATCTATGCT GTTCAGATAA GTACTGCTGA AGAAGCAGCA ACGAAACGAG AACAATGGGA ACAAATTGTA CTTAAGCCAA TTCAGGAGGC AGGGTTGGCT CAACCACATT TTGAATTAAT CGAATCGCCA TATCGGCAGT TGTTCGGGCC ATTAATGCGC TTTATTCTTG ATTTACGTGA GGCCAACCCT AACCGCCAAA TTGCTGTAAT TATCTCAGAA CTAGCTGAAA ATCGCTGGTA TTACTATTTA CTGCATAAAC AACGGGGAAT GGTTTTAAAA GCCCGTTTGT TTTTTGGCGG TAATGCCCAA ATTATTGTGA TTAATGTTCC TTGGTATTTG GAACATTAA
|
Protein sequence | MSVLDLLFGR PLANEDEEHQ RVGVVAGIPM LGLDALASAA YGPEAALTIL LPLGLLGINA ITPLVAIIIV LLGIVFLSYR QTITAYPNGG GSYTVAHENL GVIPGLIAAA ALLLDYILVV AVGISAGVGA LVSAIPKLQP YMLPLCLLIL GLITIVNLRG VRESGLAFVI PTYLFIACML IILAMGAYFV IMSGGKPIAK IAPAPQPSTM TTLSWWLLIQ AFASGCTAMT GVEAVSNGVS AFRQPATHYA RRTLTIIIGT LMVMLAGIAW LAKSYQIGAT EPGKAGYQSV LSQLVAAVSG QGILYTLTIG STLAVLALSA NTGFADFPRL CRILAHDHFL PHAFASRGRR LVYSIGIIVL ASFAGIILII FGGITDHLIP LFAVGAFLAF TLSQTGMVLH WFKHGGLNAR RNMLINGVGA VSTGITLIVI LVAKFATGAW ITLVILPALV GLFLAVRRHY QQVAQQVQLN IALDTSNLTA PIVVIPFGGW NKMAHKALRF ALKISPDIYA VQISTAEEAA TKREQWEQIV LKPIQEAGLA QPHFELIESP YRQLFGPLMR FILDLREANP NRQIAVIISE LAENRWYYYL LHKQRGMVLK ARLFFGGNAQ IIVINVPWYL EH
|
| |