Gene Haur_2751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2751 
Symbol 
ID5734632 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3508481 
End bp3509581 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content55% 
IMG OID641279894 
Productpeptidase M24 
Protein accessionYP_001545517 
Protein GI159899270 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGGCG TGCGAATTCG ACGTTTGGCT AGCGCTGCCC GCCCGCAAGG GATGGATTAT 
GTGGTGTTGA TGCCTGGGGC TAACTTACAA TATTTTACGG GCTTGACCTT GCATTTAAGT
GAGCGTTTGG CCTTGGCGTT GATCGCTGCT GATGGTCAGA GCATCAATAT TGTGCTGCCA
GCCTTGGAGC AACCGCGTGC TTTAGCCGAA TATAGCGGCG AAGTGGCGGT ACGTTGGTTT
CCATGGAGCG ATGATGAAGG CCCAATGAAT GCTTTGCGCA ATGCAGCGGC AGGCCTGATT
GGTCGCACAG TTGGCGTGGA ATATACGACG ATGCGGGTGC TAGAATTACG CGCTTTAGAA
GAAGTTGCGG GCGTACATAG CATCGATGCC AGCGCCGCGA TCGCCAGTTT GCGCATGCAA
AAGGGCGCTG ATGAAATTGC CCTGATGCGC GAAGCTGTGC GCATTGTTGA GGCTGGGCTT
AAAACCGCAA TTGAGGCGCT TCATCCAGGC CGAACCGAGC GCGAAATTGC CCGCATTTGG
GAAGAAGCGA TGCAACTTGA GGGTGGCGAA GGCCCATCAT TTGCGACGAT TGTGGCGAGT
GGCCCAAATA GTGCTAATCC ACACCATACG ACGGGCGAGC GCCAAATCCA AACTGGCGAT
TTGGTAATTT TGGATGGTGG GGCGTTGTAT CGCGGCTATT GCTCGGATAT TACCCGCACT
GTTTGCGTTG GCGAGCCAAA CGAGCAACAA CGGATGCTCT ATGAAACCGT TTTGGCGGCC
AATCGCGCTG CCTGTGCCGG AGCCAAACCA GGCATGAGCG GCGCACAGGT TGATCGGCTC
GCACGGCAAG TGGTTGAGGA TGCCGAATTA GGCCGTTACT TCATCCATCG CACAGGCCAT
GGCTTGGGTA TGGAAATTCA CGAGCCGCCC TATATCGCTA GCACCAACAC CGTTGCCCTG
CCAATTGGCA CGGTTTTTAC GGTTGAGCCA GGCACCTATG TTGCTGGAAT TGGTGGCGTG
CGGATTGAAG ATGATGTGCT GTTGACCCCC ACTGGCGCTG AATGTTTGAC CAACTTTCCA
CGGGAGTTGA TTGTCAAATG A
 
Protein sequence
MSGVRIRRLA SAARPQGMDY VVLMPGANLQ YFTGLTLHLS ERLALALIAA DGQSINIVLP 
ALEQPRALAE YSGEVAVRWF PWSDDEGPMN ALRNAAAGLI GRTVGVEYTT MRVLELRALE
EVAGVHSIDA SAAIASLRMQ KGADEIALMR EAVRIVEAGL KTAIEALHPG RTEREIARIW
EEAMQLEGGE GPSFATIVAS GPNSANPHHT TGERQIQTGD LVILDGGALY RGYCSDITRT
VCVGEPNEQQ RMLYETVLAA NRAACAGAKP GMSGAQVDRL ARQVVEDAEL GRYFIHRTGH
GLGMEIHEPP YIASTNTVAL PIGTVFTVEP GTYVAGIGGV RIEDDVLLTP TGAECLTNFP
RELIVK