Gene Haur_2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2620 
Symbol 
ID5734498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3361935 
End bp3363203 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content49% 
IMG OID641279760 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001545386 
Protein GI159899139 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0136175 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCCTG TAAAAGTTGT CTTACCAAAT GGCCTGCGAA TTTATACCGA TGAAATGCCC 
CATACCCATT CAGTTTCGAT GGGTATTTTT ACCCAAGTTG GCTCGCGCTA TGAAAATGCT
CGCCTGACGG GAATTTCACA TTTTTTGGAG CATATGTTTT TTAAGGGTAC TGCCAAATAC
CCCACTGCCA AAGACCTTAG CGAGGCAATT GAGGGCATTG GTGGCTATAT CAACGCTACT
ACCTCGTATG ATACAACCTG TTATTATTGT AAAGTTGCCA ATATTCATAC CGAACGCGGC
ATCGATGTGT TAACTGATAT GCTCAACGCT GCCCTATTCG ACCCTAAAGA AATTGAAAAA
GAACGCGGCG TGATTCAAGA AGAAATTAAA ATGTCGCTCG ATGTACCCGC TCAATGGGTG
CATCAATTGC TCGACGAATT AATGTGGGGC GATCAGCCAC TTGGCCGTGA TATCGCTGGC
ACGCTCGAAA GTGTTGGAGC CTTTAGCCGC GAAGATTTGT TGAATTACCG CGATCAGCAT
TATGTTGCAG GTAATACGGT CATTTCGTTG GCTGGCAACT TTAATAGCAC CGAAATTGTT
GATCGTCTGA CGAGCTTATT TAGCCATTAT CGGGTGCTTG ACGTGCCCAA ACCAATTACC
ACCAATAGTT TTGGCACAGC TCCAGTTGTG CATCTTTTAA ATAAACCAAC CGAACAAACC
AATTTTGTGT TGGGCCTCAA ATCGTTTGGC TATGGCGATA GCGATCGCTG GGCGCTCAGC
GTGCTCGATA GCATCCTTGG TGGCGGTATG TCTTCGCGCT TGTTCCAAGA AATTCGCGAA
GAACGCGGCT TGGCCTATAG CGTCGGCTCC TACACCGCCG AATACGATGA CGCTGGCAAA
TGGATTGTGT ATGGCGGGGT TGAAGTCAGC AAGGCAGTCG ATGCAATTGC CGCAATTATC
GAAGAACTGC GCAAATTGCG CGATCATGGG GTGACTGCCG CCGAGTTACA CCGCATCAAG
GAGCAAGTTA AGGGCGGAAT GCTGCTTGGG CTGGAAGATA CTTGGTCGGT GGCCAATCGC
AATGCTCGCC ACGAACTGCG CTACGGCGAG GTGATTCCGG TTGAGCAAAT TGTGGCTTGG
ATCGAAGCGG TCACGCTCGA AGATATTCAG CGCGTGGCTC AACGCCTAAT TCGCCCAGAT
AACTTATACT TAGCAATCAT CGGCCCGCAT GCCGAGGCTG CTGAATTTGA ACAAGCTATC
ACGTTATAG
 
Protein sequence
MAPVKVVLPN GLRIYTDEMP HTHSVSMGIF TQVGSRYENA RLTGISHFLE HMFFKGTAKY 
PTAKDLSEAI EGIGGYINAT TSYDTTCYYC KVANIHTERG IDVLTDMLNA ALFDPKEIEK
ERGVIQEEIK MSLDVPAQWV HQLLDELMWG DQPLGRDIAG TLESVGAFSR EDLLNYRDQH
YVAGNTVISL AGNFNSTEIV DRLTSLFSHY RVLDVPKPIT TNSFGTAPVV HLLNKPTEQT
NFVLGLKSFG YGDSDRWALS VLDSILGGGM SSRLFQEIRE ERGLAYSVGS YTAEYDDAGK
WIVYGGVEVS KAVDAIAAII EELRKLRDHG VTAAELHRIK EQVKGGMLLG LEDTWSVANR
NARHELRYGE VIPVEQIVAW IEAVTLEDIQ RVAQRLIRPD NLYLAIIGPH AEAAEFEQAI
TL