Gene Haur_4885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4885 
Symbol 
ID5736720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6220511 
End bp6221593 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content53% 
IMG OID641282051 
Productglutamyl aminopeptidase 
Protein accessionYP_001547643 
Protein GI159901396 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATGAAC GTTTAGCACT TATGAAGGCT TTGACCGATG CATCGGGCGT ACCAGGCAAC 
GAAGGGCCAG TTCGCGCTGT TATGGCCGAG GCCTTAGCTC CCTATGGCGA ACTCATTAAT
GATCAGCTTG GCAGTGTCGC TGCTCGTAAA GCTGGCCCCG AAGGCAGCCC CACAATTTTA
TTGGCGGGCC ACCTCGATGA AGTGGGCTTT ATGGTAACGC GGATCACCGA CGATGGTTTT
ATTAAATTCC AAGCACTTGG TGGCTGGTGG GAATTGGTGA TGTTGGCACA ACGAGTGCAA
ATTGAAACCC GTAATGGGCC AATCGCTGGC CTTATTGGCT CAAAGCCGCC GCATGTGCTC
TCGCCCGAAG CACGCAAAAA ATTGGTCGAA AAGAAAGATA TGTTTATTGA TATTGGGGCG
AGTTCAGCCG CTGAGGCCCG TGAATGGGGC GTGCGTCCAG GCGATTCAAT TCTGCCAGTG
TGCCCGTTTA CCCCCTTACA CAACCCCAAA ATCGTCATGG CCAAGGCTTG GGATAATCGC
TTTGGTTGTG CGGCAGCGGT TGAAGTGTTG CATGAGTTGG CCAATGAAAG CTTGCCTAAC
ACGGTCGTGG CCGGTGCAAC TGTGCAAGAA GAAGTTGGCT TGCGTGGCGC AGCAACCCTT
GCCAATGTGG TTAAGCCTGA TATCGCGTTT GCGATCGATG TGTGTATTGC AGGCGATACT
CCAGGCATTA GTAAAGATGA AGCTCAAGCC AAAATGGGCG CTGGCCCAGT CTTGTTATTG
ATGGATAGCA GCGTTATTCC AAACCCGCGG CTGCGCGACT TGGTGGTTGA TACCGCCGAA
GAGTTGGGCA TTCCCTATCA ATTTGATACG ATGCCTGGTG GTGGTACCGA TGCAGGTCGC
TTTCATCTAA ATAATGCTGG GGTTCCATCG TTGGCGCTGG GTGTGGCAAC CCGCTACATC
CATACTCACG CTTCGTTGTT GCATCGCGAC GATTTTGATC AGGTGGTGAC CTTGCTGGCC
GCCGTGGTAC GCAAGCTCGA TCAAGCCACT GTCGATTACA TCAAAACTGG GCAATCGGCC
TAA
 
Protein sequence
MDERLALMKA LTDASGVPGN EGPVRAVMAE ALAPYGELIN DQLGSVAARK AGPEGSPTIL 
LAGHLDEVGF MVTRITDDGF IKFQALGGWW ELVMLAQRVQ IETRNGPIAG LIGSKPPHVL
SPEARKKLVE KKDMFIDIGA SSAAEAREWG VRPGDSILPV CPFTPLHNPK IVMAKAWDNR
FGCAAAVEVL HELANESLPN TVVAGATVQE EVGLRGAATL ANVVKPDIAF AIDVCIAGDT
PGISKDEAQA KMGAGPVLLL MDSSVIPNPR LRDLVVDTAE ELGIPYQFDT MPGGGTDAGR
FHLNNAGVPS LALGVATRYI HTHASLLHRD DFDQVVTLLA AVVRKLDQAT VDYIKTGQSA