Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2664 |
Symbol | |
ID | 5734559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3418202 |
End bp | 3419299 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279806 |
Product | peptidase M14 carboxypeptidase A |
Protein accession | YP_001545430 |
Protein GI | 159899183 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00317155 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT ATCTGCTTAT GCTAATAATG GTCGCTGGAT TGCTTGGCAT TCCAGCGACC ACGCTTGTTG CTCAAACCCC AACCATTGCG CCAACGCCTA GCCCGAGCAC TGTTCCAGTT GTCGATTTGA CCTTTGGAAC TTCGTTGCAG GGGCGGGCGA TTACGGGTAC GCGCTTTGGC AATGGCCCAC GCAAGCTGTT TGTCGTCTCG AATACCCACG GCGGCCCCGA AGCCAATACC TATCAACTAG CCTTGGCGTT GCTTGAGCAT TTTCGCCAAA ACCCGCAACA AGTGCCTGCT GATGTGAGCC TCTACATTAT TCCGACGGTC AATCCTGATG GCTTGGCGAT TGGCACGCGT TTCAATAGCC GCAGCGTTGA TCTCAACCGC AATATGGATA CCAATTTCGA TGCCTGCCCC GAAAACGACT GGAATCAAAC CGTCGAGGGA GCGTATGGCA TCGTTTCCGA TACTGGCGGG GCATTCGTCG AATCGGAGTT GGAAAGCCAA TTAGTACGTG ATTTGGTGCT CGATGCTTCG GCGGTAGTTT GGGTGCATAG CGATGGTGGC GATGTGTTTC CAGCTTTTTG TGAACATGGG CCATCAATCG CCTTAGCTCA AGCCTATGCC GAAGCTAGCG GCTATCGCTA CGATCGTTAT TGGCGTAACT ATATGATCAC TGGCGGCATG CACGATTGGG CGGGTGCACT TGGCATTGCC TCGATTACCC CAGAATTATC AACGGGCGAA TTGCCTGATT TCGACGAGAA TTTGCGGGCA GTGCAAGCAG TCATGGGGCG TTCTGAGGAG CTATTGCCGT TGGTAACGAC GACAGTGGAA GTAACCACCA ACATCACCAT GCCCACCTTA TTATGGCGCT ATTGGCGTGC CCACGGTGGC CCTGATTTCT TTGGCCTACC GATTTCGCCT GTGGTTGAGT ATCAAGGGCG GCCAACCGTC TTTTTTGCCG AGCAATATTT GAGTTTAGTG CCCGATGGGG CTGATCGCAT GCAGCCTGCG CTACGTGGCG AAGGTGGTTT GGCGCTTTGG CAAGCCTATT TGCTCGATCA ACAATCAACC AGCTATCAAG TGCGCTAA
|
Protein sequence | MKKYLLMLIM VAGLLGIPAT TLVAQTPTIA PTPSPSTVPV VDLTFGTSLQ GRAITGTRFG NGPRKLFVVS NTHGGPEANT YQLALALLEH FRQNPQQVPA DVSLYIIPTV NPDGLAIGTR FNSRSVDLNR NMDTNFDACP ENDWNQTVEG AYGIVSDTGG AFVESELESQ LVRDLVLDAS AVVWVHSDGG DVFPAFCEHG PSIALAQAYA EASGYRYDRY WRNYMITGGM HDWAGALGIA SITPELSTGE LPDFDENLRA VQAVMGRSEE LLPLVTTTVE VTTNITMPTL LWRYWRAHGG PDFFGLPISP VVEYQGRPTV FFAEQYLSLV PDGADRMQPA LRGEGGLALW QAYLLDQQST SYQVR
|
| |