Gene Haur_0590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0590 
Symbol 
ID5732488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp679819 
End bp681375 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content52% 
IMG OID641277717 
Productpeptidase M1 membrane alanine aminopeptidase 
Protein accessionYP_001543366 
Protein GI159897119 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0535796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGTT GGCTTTATAT TGGTTTGGCT GTGCTTTTGA TTGGCTGTGG CCAAGCCTTG 
AAAGCGACCA GCAGCGAGGT TGCTCCAAGC ACTAAGCCTG CCGCCGCGCC AACGATTGAT
TTAACTGCCC TGATCACATG GCAGCCAGTG CCAGCCTATA CCCCACAAAT TCCCACGCCA
ACTAATATTC CAATTCCCAC GCCAATTCCC GATGCTTGGC CAAATGAGCA GGCTTTGGCG
CTGCGCCCCG AATTTGCGGG CGATGCTACT CATTCAACCA TGCCACGATA TGCAATGCAG
ATTACGCTTG AGCCAAATGT AGGTCGTTAT AGCGGCAGCC AAATTATTAC CTACACCAAT
ACGACTGGCG CTAGTATCAA TAGTTTGGTG CTGCGTTCGT ATAATAATTT TCCGCCCGAT
GCCCAAGGCG ATGGCGGCGA TACCACGCTT GAAGTTACTA AGGCTTGGAG CAATGGCAAC
GAATTGAATC TTGCTCGTGA AGCCGAAAAT ACCGCTTTTC GGCTTAATCT CGCCACAAGC
CTAGCACCCA ATCAGCAGGT TGTAATCAGC ACCCAATTTG CTGGCACAAT CAAGGCTTGG
CCCGATGGCT CGTATCCCTT GCTTTCGGCC TATCCGATGC TAGCGGTTTG GGATGCAGCA
GCCAACGATT GGCGCATGGA TGTAACGCGC TTTCCTGATC GGGTGTTTGC TGAAACTGCA
CTCTATCGGG TGCAACTCAA GCTACCTGCC GAATATCAAG TCATTGGAGC GGGAACGCTG
CTGGAGCAAA ATGCTGATCG CACCAGCCGC ACCTATGTTA GTGGGCCAGT GCGGGAGTGG
GCCGCCGCGC TTGGTCAATT TTCGGTCAGT ACAAGCAGCA TTGATGGAAT TGACGTTAAT
GCCTATGGCC CCGATTATCT TGATTTGGCG CGGGTGCGCG AGATGGCAAT TGGCGCTTTG
ACTAGTTATC AAGCCAAAAT CGGCGCGTAT CCCTACCGCA AACTCGATCT GCATGTGATG
CCATGGGATA GCGGCGGCGG CATCGAGTAT CCAGCTTACA CGATTATTTT GGTGAACGAT
GGGATCAATC GTGATGCTGA TTATGTGGTG TTCCATGAAG TAGCCCATCA GTGGTGGTAT
GGGTTGTTGG GCAACGATGT CTATCGTGAG GCTTGGCTGG ATGAGGCCAT GGCTAGCTAC
TTGACCTATG TGGCAACTCA GGACGTGCTT GGTCAAGCTG CCGCCGATAG CTATTACAGC
GGCGAAATCG AGCGTTTGGC GCTCAGTAAT CAAGCCAATG GCAATTGGCC AGCAGGCTTG
GCGATCAATC AATATCCATC ATTTAATAGC TACTATCGCG CAGTCTATGG CAAAGGCGCG
GCGATGTTGC ATCAGTTACG CATCAAGCTT GGCGATCAAA GCTTCTTCCA AGGTTTAGCC
CAACTCAACG ACCAAAAGCG CTATCAGATT ATCACCCGTA GCGATTTCCA AAGCGTTATG
GAGCAAAGCA GCGGCCAAGC CCTAGGCGAG TGGCTGGATG GCTGGCTCAA CTGGTGA
 
Protein sequence
MRRWLYIGLA VLLIGCGQAL KATSSEVAPS TKPAAAPTID LTALITWQPV PAYTPQIPTP 
TNIPIPTPIP DAWPNEQALA LRPEFAGDAT HSTMPRYAMQ ITLEPNVGRY SGSQIITYTN
TTGASINSLV LRSYNNFPPD AQGDGGDTTL EVTKAWSNGN ELNLAREAEN TAFRLNLATS
LAPNQQVVIS TQFAGTIKAW PDGSYPLLSA YPMLAVWDAA ANDWRMDVTR FPDRVFAETA
LYRVQLKLPA EYQVIGAGTL LEQNADRTSR TYVSGPVREW AAALGQFSVS TSSIDGIDVN
AYGPDYLDLA RVREMAIGAL TSYQAKIGAY PYRKLDLHVM PWDSGGGIEY PAYTIILVND
GINRDADYVV FHEVAHQWWY GLLGNDVYRE AWLDEAMASY LTYVATQDVL GQAAADSYYS
GEIERLALSN QANGNWPAGL AINQYPSFNS YYRAVYGKGA AMLHQLRIKL GDQSFFQGLA
QLNDQKRYQI ITRSDFQSVM EQSSGQALGE WLDGWLNW