Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3105 |
Symbol | |
ID | 5734977 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3917677 |
End bp | 3919575 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280249 |
Product | peptidase M14 carboxypeptidase A |
Protein accession | YP_001545871 |
Protein GI | 159899624 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000214938 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCCGT CACGCATCGT TCGGTTGGTT GGTTCACTCG CATTAGCCGC AGGGCTGATG GCTCCGTTGA GTGCTTTGGG GCAAACACGG CAGCCAGTTC AGCAAACGGA GCCGCTTGAT CAGGCGCGGG CCTATCATCT TGAAGGCGTA ACCACGCGCG AAGATCGCAA TGCAATTGCC GCAACTGGTG CTTCAATTGA TGCAGTTCAT GGCAAGGTGT TGGATATTAC CGCCAATGCC GAAGAAGCTG CGGCGATTGA GCGCTTAGGC TTTAAATTGG TCGAGCTACC TGAACTGACC GATTTTCCAG GCGCAGATTC GGCCTACCAT AATTATGCTG AGATGACCAG CAATATTGCG GCAGTTGTTG CCAGCAAGCC GAGCATTGTG AGCCGCTTTA GCATTGGCCG CTCGTATGAA AATCGCGATT TGATTGCGGT TAAAATTAGC GATAATGTCG CAACCGATGA GAACGAGCCA GAAGCCTTGT TCATCGGCCA GCACCATGCC CGCGAACACC TGACCGTCGA AATGACCCTG TATCTGTTAC ATTTGCTGGT CGATAACTAT GGCATTGACA ATCGGATTAC CAACATTGTC AATAGCCGCG AAATCTACAT CGTTTTCAGC TTGAACCCTG ATGGCAGCGA ATACGACGTA GCAACTGGCA GCTATCGCAG CTGGCGCAAA AATCGCCAAC CCAACAGTGG CTCTTCCTAC GTTGGCATCG ACCTTAACCG CAACTATAGC TACAAATGGG GCTGCTGTGG TGGCTCAAGT GGCTCAACCT CGAGCGATAC CTATCGGGGC ACGGCAGCCT TTACCGCTCC CGAAACCCAA GCGATTCGTA ATTTCGTCGC TAGTCGGGTA GTTGGCGGTA AACAACAAAT CAAAACCTCG ATTTCATTCC ATACCTATAG CGAACTGGTG TTATGGCCAT ATGGCTACAC CTACGATGCC TATCCGAGCG ATATGGTACG CGACGATTAT GATGCAATGG CCGCGCTTGG CCGCACTATG GCATCGAGCA ATGGCTATAC ACCACAACAA TCCAGCGATT TGTATGTTGC TGATGGCACC TATGAAGATT GGGCGTATGG GGTGCATCGA ATCTTTGCCT ATACCTTTGA AATGTATCCT CGCTCTTCGA GTCCAGGTTT CTACCCACCT GACGAAGTGA TTAGCCGTGA AACCACCCGT AACCGTGAAT CGGTCTTGTA TTTGTTGGAA CAAACCGATT GTCCCTATCG CGTGATTGGC AAAGAAGCTC AATATTGTAG CGGCGGTGGC ACGCCAACCC CAACCGCGAC ACCTGGGCCA ACCGCAACCC CAGGCCCAAC CGCTACACCA AACCCAGTCG TCACCGTATT TAGCGACGAT TTTGAAGCCA ATCAAGGCTG GACAACCAAT CCCAATGCGA CTGATAGCGC AACCACCGGC GCATGGGAAC GGGGCGACCC TGAAGCAACC GATAGCAGCG GAGCCAAGCA GCTTGGCACA ACTGTCAGCG GCAGCAACGA CCTTGTAACG GGCCGTTTGG CTGGCAGTTC AGCTGGAGCC TACGACCTTG ATGGTGGCTC ATCGTCAGTC CGTTCGCCAG CCTTCACCTT GCCAAGTTCT GGTAATTTGA GCTTGAGTTT CAGCTACTAC TTGGCTCATG GCTCGAATGC CAGCAGCGCC GATTACTTCC GCGTGTCGCT CGTCACCAGC TCTGGCACGG TCAAAGTCTT TGAAAAATTG GGCAGCGCAA CCGATGTTGA TGCCGCCTGG ACAGCCGCTA CTGTTAGCTT AAACAGCTAC GCTGGTCAAT CGGTGCGAAT CTTGATTGAA GCTTCCGATG CCAGCACTGC TAGCTTGGTA GAAGCTGCTG TGGATAATGT TAGCGTGACC CAACGCTAA
|
Protein sequence | MKPSRIVRLV GSLALAAGLM APLSALGQTR QPVQQTEPLD QARAYHLEGV TTREDRNAIA ATGASIDAVH GKVLDITANA EEAAAIERLG FKLVELPELT DFPGADSAYH NYAEMTSNIA AVVASKPSIV SRFSIGRSYE NRDLIAVKIS DNVATDENEP EALFIGQHHA REHLTVEMTL YLLHLLVDNY GIDNRITNIV NSREIYIVFS LNPDGSEYDV ATGSYRSWRK NRQPNSGSSY VGIDLNRNYS YKWGCCGGSS GSTSSDTYRG TAAFTAPETQ AIRNFVASRV VGGKQQIKTS ISFHTYSELV LWPYGYTYDA YPSDMVRDDY DAMAALGRTM ASSNGYTPQQ SSDLYVADGT YEDWAYGVHR IFAYTFEMYP RSSSPGFYPP DEVISRETTR NRESVLYLLE QTDCPYRVIG KEAQYCSGGG TPTPTATPGP TATPGPTATP NPVVTVFSDD FEANQGWTTN PNATDSATTG AWERGDPEAT DSSGAKQLGT TVSGSNDLVT GRLAGSSAGA YDLDGGSSSV RSPAFTLPSS GNLSLSFSYY LAHGSNASSA DYFRVSLVTS SGTVKVFEKL GSATDVDAAW TAATVSLNSY AGQSVRILIE ASDASTASLV EAAVDNVSVT QR
|
| |