Gene Haur_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0115 
Symbol 
ID5732008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp147983 
End bp149488 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content52% 
IMG OID641277237 
Productcarboxypeptidase Taq 
Protein accessionYP_001542895 
Protein GI159896648 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0714794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAC AATTAGCTGA TTTGAAGGAA AAAATTGGGG TTGCTCAAGA TTTATCGTAT 
GCTAGCGCTG TGTTAAATTG GGATCAAAGC ACCTACATGC CCGCTGGCGG GGCCGAAGCT
CGTGGTCGCC AAATGGCCAC GCTCAGCCGC TTAGCCCACG AGCATTCCAC CGCGCCCGAA
GTTGGTCGCT TGCTTGAACA GCTCGTGCCG TGGGCCGAAC AACAAGATCC TGAATCGGAC
GACGCAGCCT TGGTCTTGGT GACCAAGCGC GATTATGACC AAGCGGTGCG CGTGCCCAGC
GAGTTTATTG CTGAATTGTA TTCGCACGTC GCCAAAACCT ATACCGCCTG GACCCAAGCT
CGCCCCAACA ACGACTTCGC TTCAGTCGCC CCATTACTCG AAAAAACCCT TGACCTCAGC
CGCCGCTATG CTGAATTTTT CGGCCCCAGC GAACATATTG CCGATCCACT GATCGATATG
GCTGATCAAG GCATGACGGT GGCAAAAATT CGTGAGATTT TTGGACCCTT GCGCGAACAA
CTCGTGCCAT TGGTCAAAAC CATCACCGAG CAACCTGCCG CTGACGATAG CTGTTTGTTG
CAACACTACC CCGAAGCTGA GCAATTGGCC TTTGGTGAAA GCCTCATTCG CGAAATTGGC
TACGACTTCG AGCGCGGTCG CCAAGATAAA ACCCACCATC CCTTCATGAC CAAGTTCTCG
ATTGGTGATG TGCGGATCAC CACCCGCTTC CGTGAAAACG ATTTGAGCGA TGGCCTGTTT
AGCACAATTC ACGAAACTGG CCATGCCGTC TATGAGTTGG GCGTAAATCC AGCCTACGAA
AATACCCCAT TGGCAAGCGG AGCTTCGGCA GGAACCCACG AATCACAATC ACGCTTGTGG
GAAAATGTGG TTGGTCGCAG CCGCGCCTTC TGGCAATATG CCTATCCCAA AGCTCAGGCC
GCTTTCCCCA ATCAACTGGG CAAGGTCGAT TTAGATACCT TCTATCGGGC GATTAACAAA
GTGCAGCGCT CGTTGATTCG CACCGATTCA GACGAAGTGA CCTACAACTT GCACGTTATG
ATTCGCTTCG ATTTGGAATT GGCCTTGCTC GAAGGTAAAT TGGCAATTCG CGATTTGCCT
GAAGCTTGGC ACGAACGCTA TCGCAGCGAT TTGGGCATTA CCGCACCCGA TAATCGCGAT
GGTGTGCTGC AAGATGTCCA CTGGTATGGT GGGATTATCG GTGGCTCGTT CCAAGGCTAC
ACGTTGGGCA ATATCCTCAG CGCTCAAGTT TTTGATGCTG CTGTGCGAGC CAATCCCAAT
ATACCAACTG AAATTAGCCA AGGCCAATTT GCCAACCTGC ACAATTGGCT GAAATCGAAT
ATGTACGTTC ATGGCCGCAA ATATAGCGTA CCAACCTTGA TCCGCAAGGT TACAGGCCAA
GACCTCAGCA TTGAGCCGTA TATTCGCTAT CTCCGCACCA AATACGGCGA ACTCTATTCG
TTGTAA
 
Protein sequence
MQEQLADLKE KIGVAQDLSY ASAVLNWDQS TYMPAGGAEA RGRQMATLSR LAHEHSTAPE 
VGRLLEQLVP WAEQQDPESD DAALVLVTKR DYDQAVRVPS EFIAELYSHV AKTYTAWTQA
RPNNDFASVA PLLEKTLDLS RRYAEFFGPS EHIADPLIDM ADQGMTVAKI REIFGPLREQ
LVPLVKTITE QPAADDSCLL QHYPEAEQLA FGESLIREIG YDFERGRQDK THHPFMTKFS
IGDVRITTRF RENDLSDGLF STIHETGHAV YELGVNPAYE NTPLASGASA GTHESQSRLW
ENVVGRSRAF WQYAYPKAQA AFPNQLGKVD LDTFYRAINK VQRSLIRTDS DEVTYNLHVM
IRFDLELALL EGKLAIRDLP EAWHERYRSD LGITAPDNRD GVLQDVHWYG GIIGGSFQGY
TLGNILSAQV FDAAVRANPN IPTEISQGQF ANLHNWLKSN MYVHGRKYSV PTLIRKVTGQ
DLSIEPYIRY LRTKYGELYS L