Gene Haur_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2642 
Symbol 
ID5734522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3389943 
End bp3391043 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content45% 
IMG OID641279784 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_001545408 
Protein GI159899161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTTGATC AACGCTGGCA ACAGTTGGCC AAAATCATTG TCCATCACTC ATTAGAGCTA 
CAACCAAACG ATTTGCTACG AATTCAAGCT GAAGCTATTG CGAAGCCATT GCTCTATGCG
CTCTACCGCG AGGCACTCCA TGCTGGAGCC TTAGTTATTC CCAAAATTGT CGATCCGGTA
TTTGAAGAAA TTATGCTCAA AGAGGGCACT CCTGAGCAGC TGCAATTTGT GCCAAGCACC
TTAGTTCACG AAATTGAAAC CATGACTACT TGGTGTGATA TTTATAGCGA AATAAATACG
AAACATTTCA ACCAAGCCGA TCAACAACGC CAACTATTGC GCCGAAAAGC ATTCGGCCCA
GTCCAAGTAT TATTCGATAG CAGAGCAGCT CAAAATCAAT TGCGCTGGTG CGATGTGCTT
TATCCAACCG AGGCTTTTGC TCAAGACGCA GGTATGTCGC TGTGGGATTT TGAAGACTTG
GTAGTAAAAT CCTATCTGCT TGATCATCCA GATCCAGTTA CAGCGTGGCA GACCATCCAT
CAACAACAGC AAAAAGTTAC CCACTTCCTC AATAGTTGTC GCTCAATTCG GATTGAAGGG
CCAGACGTTG ATTTGAGTTA TCGCTGTGAA GATCGCATTT GGATTAATTG TGCTGGTAAA
CGCAATCTGC CCGATGGCGA AGTCTTTACC GCGCCAATCG AAGATTCAGT CAATGGCCGA
TTGAAGATTA GCTATCCAAG CATTTATCAG GGGAATTTGG TTAGCGGAAT TCAGCTCGTG
ATTGAAGATG GCAAAGTAAC CCAAGCAACT GCTGAGCAAG GCCAAGATTT TCTGCATACC
ATGCTTGATC TTGATGCTGG TGCTCGGTAT ATTGGCGAGG TTGCCTTCGG CCTGAACCCA
GGCATTACAA AACCAACTGG TCATACTATT TTCGATGAAA AGATGGCTGG AACGATGCAC
TTAGCACTTG GTCGAGCCTA TCCTGAGTGT GGCGGCAAAA ATGAATCAAC CCTGCACTGG
GATTTAGTCT GCGATTTACA TCAAGCTGAA GTGTATGCCA ATAATGCGCT GTGCTACAAA
AATGGTGAGT TTATTATTTA A
 
Protein sequence
MFDQRWQQLA KIIVHHSLEL QPNDLLRIQA EAIAKPLLYA LYREALHAGA LVIPKIVDPV 
FEEIMLKEGT PEQLQFVPST LVHEIETMTT WCDIYSEINT KHFNQADQQR QLLRRKAFGP
VQVLFDSRAA QNQLRWCDVL YPTEAFAQDA GMSLWDFEDL VVKSYLLDHP DPVTAWQTIH
QQQQKVTHFL NSCRSIRIEG PDVDLSYRCE DRIWINCAGK RNLPDGEVFT APIEDSVNGR
LKISYPSIYQ GNLVSGIQLV IEDGKVTQAT AEQGQDFLHT MLDLDAGARY IGEVAFGLNP
GITKPTGHTI FDEKMAGTMH LALGRAYPEC GGKNESTLHW DLVCDLHQAE VYANNALCYK
NGEFII