Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4885 |
Symbol | |
ID | 5736720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6220511 |
End bp | 6221593 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641282051 |
Product | glutamyl aminopeptidase |
Protein accession | YP_001547643 |
Protein GI | 159901396 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATGAAC GTTTAGCACT TATGAAGGCT TTGACCGATG CATCGGGCGT ACCAGGCAAC GAAGGGCCAG TTCGCGCTGT TATGGCCGAG GCCTTAGCTC CCTATGGCGA ACTCATTAAT GATCAGCTTG GCAGTGTCGC TGCTCGTAAA GCTGGCCCCG AAGGCAGCCC CACAATTTTA TTGGCGGGCC ACCTCGATGA AGTGGGCTTT ATGGTAACGC GGATCACCGA CGATGGTTTT ATTAAATTCC AAGCACTTGG TGGCTGGTGG GAATTGGTGA TGTTGGCACA ACGAGTGCAA ATTGAAACCC GTAATGGGCC AATCGCTGGC CTTATTGGCT CAAAGCCGCC GCATGTGCTC TCGCCCGAAG CACGCAAAAA ATTGGTCGAA AAGAAAGATA TGTTTATTGA TATTGGGGCG AGTTCAGCCG CTGAGGCCCG TGAATGGGGC GTGCGTCCAG GCGATTCAAT TCTGCCAGTG TGCCCGTTTA CCCCCTTACA CAACCCCAAA ATCGTCATGG CCAAGGCTTG GGATAATCGC TTTGGTTGTG CGGCAGCGGT TGAAGTGTTG CATGAGTTGG CCAATGAAAG CTTGCCTAAC ACGGTCGTGG CCGGTGCAAC TGTGCAAGAA GAAGTTGGCT TGCGTGGCGC AGCAACCCTT GCCAATGTGG TTAAGCCTGA TATCGCGTTT GCGATCGATG TGTGTATTGC AGGCGATACT CCAGGCATTA GTAAAGATGA AGCTCAAGCC AAAATGGGCG CTGGCCCAGT CTTGTTATTG ATGGATAGCA GCGTTATTCC AAACCCGCGG CTGCGCGACT TGGTGGTTGA TACCGCCGAA GAGTTGGGCA TTCCCTATCA ATTTGATACG ATGCCTGGTG GTGGTACCGA TGCAGGTCGC TTTCATCTAA ATAATGCTGG GGTTCCATCG TTGGCGCTGG GTGTGGCAAC CCGCTACATC CATACTCACG CTTCGTTGTT GCATCGCGAC GATTTTGATC AGGTGGTGAC CTTGCTGGCC GCCGTGGTAC GCAAGCTCGA TCAAGCCACT GTCGATTACA TCAAAACTGG GCAATCGGCC TAA
|
Protein sequence | MDERLALMKA LTDASGVPGN EGPVRAVMAE ALAPYGELIN DQLGSVAARK AGPEGSPTIL LAGHLDEVGF MVTRITDDGF IKFQALGGWW ELVMLAQRVQ IETRNGPIAG LIGSKPPHVL SPEARKKLVE KKDMFIDIGA SSAAEAREWG VRPGDSILPV CPFTPLHNPK IVMAKAWDNR FGCAAAVEVL HELANESLPN TVVAGATVQE EVGLRGAATL ANVVKPDIAF AIDVCIAGDT PGISKDEAQA KMGAGPVLLL MDSSVIPNPR LRDLVVDTAE ELGIPYQFDT MPGGGTDAGR FHLNNAGVPS LALGVATRYI HTHASLLHRD DFDQVVTLLA AVVRKLDQAT VDYIKTGQSA
|
| |