Gene Haur_4249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4249 
Symbol 
ID5736103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5421810 
End bp5423483 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content51% 
IMG OID641281404 
Productpeptidase M14 carboxypeptidase A 
Protein accessionYP_001547009 
Protein GI159900762 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2866] Predicted carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAGA TTGATTACAC ACGCTATTAT CGTTTCGCTG AGTTGGTGGA AGCGCTAGAA 
GGCTTTGCCG CCGAATATCC CGATTTGATT AGTTTGCAAT CGATCGGTAA AAGTTATGAA
GGCCGTGATT TGTGGTTAGC GACCGTTACT AATGTTGCAA CTGGTGGGCC ACGCGAAAAG
CCAGCCTTTT GGGTTGATGC CAATATCCAT GCGAGCGAAG TAACTGGCGC AATGGCTGGC
TTACACTTGA TCGATACGCT GCTCAAAGGC TATGGCAACG ATGCTGAATG CACGCGGTTG
CTTGATCGCA CGACCTTCTA CATTTTGCCA CGCTTCAATC CTGATGGAGC TGAACGGGCC
TTGACCACGC CCTATGTAGT GCGGTCGAGC GTGCGACCTT ATCCCTATGC TGAACGCATC
GATGGCTTGT ATCAAGAAGA TATCAACGGC GATGGGATTA TTTTGCAGAT GCGCTTGGTT
GATCCCAACG GCGATTGGCG GGTCTCCGAG CATGATCCAC GGGTGATGGT CAAGCGCAAG
CCCTATGAAA TTGGCGGCAC CTACTATCGA ATTTTGCCCG AAGGCTTGAT TCAAAATTAC
GATGGGGTCA ATATCAAACT GAGCCGCGCA GTCGAAGGCT TGGATATCAA CCGCAACTTT
CCAGTTGATT GGCGACCTGA AGCCGAGCAA TATGGTGCTG GCCCCTACCC AACCTCTGAG
CCAGAAATCC GCGCTGTGGT GCAATTTATC GTCGATCACC CCGAAATTCA TAGTGGCCTG
ACCTACCACA CTTATTCGGG CGTGCTGCTG CGACCATATG GCGACCGCGC CGATGATCAG
ATGAATTTGC ATGATCTCGA TGTGTTTAAG GCGTTGGGTA AACATGGAAC CGAGTTAACC
GGTTGGCCCA GTGTTTCGGT TTACCACGAT TTTCGTTACC ACCCCAAAGA TGTGATTACC
GGGGTGTTTG ATGATTGGGT CTACGATCAC TTGGGTATGT TTGCCTGGAC AGTCGAATTT
TGGGATTTAG TTGGTTCGGC AGGGATCAAA GATCGCAAAT TTATCGAGTG GTTCAAAGAG
CACCCCGAAG AAGATGATCT TAAAATTATG CAATGGGTGG ATGAGCATGG CGAGGGCTTG
TGCTTCTACG ATTGGACAGC CTTCGAACAT CCCCAGCTTG GCCCAGTTGA AATTGGTGGC
TGGCATCCGA TGTATGCCTT CCGTAACCCA CCGCCAGCCA AATTGCTTGA AACGATTGCG
CCTGTGACCC AATTTGCCTT AGCTCATGCC GCGATTGCCC CATTCACCAC AATTAGCAGC
TTTGAGCTTG AGGCGTTGGG CGATAACGTT TATCGGCTGC AAGCAGTGGT GCAAAATGAA
GGTTATTTAC CAAGTTATGG CTCGCAAAAA GGCCGCGAAC GCAAGGCGAC CTTGCCACTT
GAAGCCTTGC TCAATCTACC CGAAGGTTCA AGCCTCAAGC TTGGCCAAGC CAAAACCACG
ATTGGCGATT TGGAAGGGCG TTCAGGCCGA GTTTCATTCT TTGGCTTTAG CAATGGTTCG
ACCACTGATC GCACCAAAGT CGAGTGGGTG GTGCATGTGC CAAATCCTGG GGTGATTGAA
TTGACGATCC AAGGTGGACG CGGTGGCATT GCCCGCGCTA AGCTCGAAAT CTAA
 
Protein sequence
MPEIDYTRYY RFAELVEALE GFAAEYPDLI SLQSIGKSYE GRDLWLATVT NVATGGPREK 
PAFWVDANIH ASEVTGAMAG LHLIDTLLKG YGNDAECTRL LDRTTFYILP RFNPDGAERA
LTTPYVVRSS VRPYPYAERI DGLYQEDING DGIILQMRLV DPNGDWRVSE HDPRVMVKRK
PYEIGGTYYR ILPEGLIQNY DGVNIKLSRA VEGLDINRNF PVDWRPEAEQ YGAGPYPTSE
PEIRAVVQFI VDHPEIHSGL TYHTYSGVLL RPYGDRADDQ MNLHDLDVFK ALGKHGTELT
GWPSVSVYHD FRYHPKDVIT GVFDDWVYDH LGMFAWTVEF WDLVGSAGIK DRKFIEWFKE
HPEEDDLKIM QWVDEHGEGL CFYDWTAFEH PQLGPVEIGG WHPMYAFRNP PPAKLLETIA
PVTQFALAHA AIAPFTTISS FELEALGDNV YRLQAVVQNE GYLPSYGSQK GRERKATLPL
EALLNLPEGS SLKLGQAKTT IGDLEGRSGR VSFFGFSNGS TTDRTKVEWV VHVPNPGVIE
LTIQGGRGGI ARAKLEI