Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4490 |
Symbol | |
ID | 5736341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5750015 |
End bp | 5751271 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281653 |
Product | dipeptidyl aminopeptidase/acylaminoacyl-peptidase-like |
Protein accession | YP_001547250 |
Protein GI | 159901003 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAATGGT TCAAGCGTTC TTGGCGCAAC ATTCTAGGAA TTATGGTTGG CCTTTTGGCA ATTGGTTTGG TCTGGCTGCT CACCACTGAT AACGTGACGA TCTGGCCGAT TCGCAATACG CTGCGTTATC AATTCGACCA ATGGCGAACC GACAGCCAAA TGCCAAGCCA GCCAATCGCT GATCGCAGCA TCGTTGGGTG TATTACCAAT GCGCAGCAGC AACCAATCGT TGGGGCAATT GTGGCTGTCA GCGAACGCAA CGGCAGATTA CATCGAGCTA TCAGCGATCG TCAAGGCTGT TATCGGCTTG GCAATGTGCC AGCTAACCAA TATCGCTTAT TGGTGACTGC GCCGAGTTAT CGCGATGATT TGATCGATGT TGATGTGCAG CAAGCCCAAA CTGAGCAGCA TGCACAACTT TTGCCGGCAA TTGCCCCAAG TTATGCCCCA GTCGAAAAAC TCGTGCTTGG CCCAAGCAAT GTAGTTAGCC GAACTACGCC CTACCCTACC CAAGCATTGC GCCAGCAGGT GCAAGTTTGG AGCGATAACG GCGAGCAGCA ATTGACCCTG CTCTATCGAC CAATTACCGC CACGCAACCG TTGCCGTTGA TGTTGGCGGT CTACCCTGGC CCTGCCAACG AATGGGAGAG CGTGAGCATT CCCTTGGCCG AGCGCGGTTA TAGCGTGTTG GCGGTTGGTC CAGCCTACAG CCTCGACCTC GAAACTGATA TTGCCGATCT CAAGCGCTTG TTGGCGTTGG CGCGGGGTGG CTCGTTCGTG GGAGTTGATG GCAGCCACAT TGCGATTATG GCAGGCAGTT ATAGCAGCCT GCACGTTTTG CGCCTGTTGC AAGACGATGT AGGTTTTACG GGGGTGGTAT TGTTGGGGCC AATTAGCGAT TTGTTTGCCA TGCGCGAGAG CTTTGTGGCC GGAACATTCA TGCCGCCGTT TGGGCTTGAT CAAGCCCTGA TTGCCCTAGG TTATCCCGAC GAGGAGATTC AGCGCTATGC CAGCTATTCG GCTCAATTAC ACCCTCGCGC TGATTTGCCG CCAATTTTGT TGATGCACAG TCGGAATGAT GAAGTTGTGC CCGCCAGCCA ATCAGAATTT TTGGCTGAGC AATGGCGGGG CTTGGGCGTT GAGGTTGAAA GTTATTTTTT CGATGGCATG TCGCATTATC TGCGGGCGGT CGAACCTTCA CCAGAGCTTG ATGAGTTGTA TCGCATAACT TTAGATTTTC TGGCACGAGT TAATTAG
|
Protein sequence | MQWFKRSWRN ILGIMVGLLA IGLVWLLTTD NVTIWPIRNT LRYQFDQWRT DSQMPSQPIA DRSIVGCITN AQQQPIVGAI VAVSERNGRL HRAISDRQGC YRLGNVPANQ YRLLVTAPSY RDDLIDVDVQ QAQTEQHAQL LPAIAPSYAP VEKLVLGPSN VVSRTTPYPT QALRQQVQVW SDNGEQQLTL LYRPITATQP LPLMLAVYPG PANEWESVSI PLAERGYSVL AVGPAYSLDL ETDIADLKRL LALARGGSFV GVDGSHIAIM AGSYSSLHVL RLLQDDVGFT GVVLLGPISD LFAMRESFVA GTFMPPFGLD QALIALGYPD EEIQRYASYS AQLHPRADLP PILLMHSRND EVVPASQSEF LAEQWRGLGV EVESYFFDGM SHYLRAVEPS PELDELYRIT LDFLARVN
|
| |