Gene Haur_4725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4725 
Symbol 
ID5736569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6036071 
End bp6037156 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content54% 
IMG OID641281890 
Productpeptidase M24 
Protein accessionYP_001547484 
Protein GI159901237 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.28239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTCAAG AACGGCTCGC AGCGTTGCGG ATGCTTTTTG AAGCAGCCCA AATCGATGGA 
TTGTTGGTCG CTAATAGCCA AAATCGGCGC TATCTGAGCG GGTTTACTGG CTCGGCGGGC
TTGTTGATCA TTGATGCTCA ACGAGCATTA TTAATCAGCG ATGGCCGCTA TACCGTGCAA
GCCGCCCAAG AGGCCAGCCA ATTTGAAACG ATCACCCGCA CGCTTGATGA AAGCTTGTAT
AGCTGTGTTG GCCGCCATAT TGCGCCGATC AAACGCTTGG GCTTCGAGCC AGCAACCCTC
AGCGTTGCCG ATTACAATGC CTTGCGCCAA GCCTTGCCTG CTGATGTAAC CTTGGTTGCC
ATCGGGGCAT TGACCGAGCA ACTCCGCGCG ATCAAAAGCG ACGAAGAAGT TGCGGCCTTG
CGTCAAGCAA TTAACATCAC CGACCAAGCC TTAGCGGCAG TCAAGCCAAT GTTGCGCCCA
AGCATGCTCG AACGCGAAGT CGCTTGGGAA TTGCACAAGG CAATTGTTGA GCATGGCGGC
GATGGTTTAG CTTTTGAAAT TATCGTGGGT GCTGGCTTAA ATAGTGCTTT GCCCCATTAT
CACGCTGGTA ACGCCCCGCT GGGCCAAGGC CAGCCGATTG TGGTCGATTT TGGGGCGCTC
TATGCTGGCT ATCATGGCGA TATGACCCGC ACCTTGGTGC TCGGCCAGCC CGATGCCAAA
TTTGATGAAA TTTATGGCAT TGTGCGCCAC GCGCTTGCGG ATGCAACCAA CGGCATCACC
GCCAATACCA CTGGCAAAGA AGCCGATGCC TTGGCTCGCG ATGTGATCGA AGCCTCAGGC
TATGGCGAAT ATTTTAGCCA TGGCACAGGC CACGGGGTTG GCCTGCAAAT TCATGAAGAG
CCACGGCTCA GCCGCGTTCA CAACGATTTG CTGCCAGTTG GCTCAATTTT TAGCATCGAG
CCTGGCATTT ATTTGCCCGA TTGGGGCGGC GTGCGGCTCG AAAACTTGGT TTTACTCAAT
GCCAATGGTG TTGAAACGCT TACACAATCG CCACTTGACC CGATCATTGT GATCGAGCAA
GCCTAA
 
Protein sequence
MSQERLAALR MLFEAAQIDG LLVANSQNRR YLSGFTGSAG LLIIDAQRAL LISDGRYTVQ 
AAQEASQFET ITRTLDESLY SCVGRHIAPI KRLGFEPATL SVADYNALRQ ALPADVTLVA
IGALTEQLRA IKSDEEVAAL RQAINITDQA LAAVKPMLRP SMLEREVAWE LHKAIVEHGG
DGLAFEIIVG AGLNSALPHY HAGNAPLGQG QPIVVDFGAL YAGYHGDMTR TLVLGQPDAK
FDEIYGIVRH ALADATNGIT ANTTGKEADA LARDVIEASG YGEYFSHGTG HGVGLQIHEE
PRLSRVHNDL LPVGSIFSIE PGIYLPDWGG VRLENLVLLN ANGVETLTQS PLDPIIVIEQ
A