Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3304 |
Symbol | |
ID | 6976744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 3611033 |
End bp | 3612667 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643392815 |
Product | peptidase S10 serine carboxypeptidase |
Protein accession | YP_002277646 |
Protein GI | 209545417 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2939] Carboxypeptidase C (cathepsin A) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.169507 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0135899 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGACAC GCCTGTCGCT TTTGCTATCG GCCTCGCTGC TATGCCTGGG CCAGATGCCC GCCCGGGCCG ATGACCGCGC CGATGCGCCC CATGCCGTCC ACGCCCCCGC CCATGCCGCG CCGGCGGTGG AGCGGACCAG CAGCGGAACC GTGACGATCG GCGGCAAGAC CATCGCCTAT CAGGCCGTGG CGGGCACCCT GCTGGTACAT CCGCAGAAAT GGGACGATTC GGACGATATC GTCCATCAGG AGACCCCGGG CAGCGACGTC CACACCACCG ACGGCAACCC CGACGCCACG GCGTCGATGT TCTATGTCGC CTATTTCCGC AAGGATGCCC GGCCCGAATC CCGGCCCATC ACCTTCGTCT TCAACGGCGG GCCGGGATCG TCCTCGGTGT GGCTGCACAT GGGCGCGTTC GGTCCGCGCC GGGTGGTGAC CGCCGACGAT ACCCATACCC CCGCCAGCCC CTACCGGCTG ATCGACAATC CGCAGAGCCT GCTGGACGCG ACCGACCTGG TCTTCATCGA CGCGCCCGGC ACCGGCTTCG GCCGGATCAC CGGCAAGGAC AAGGAAAAGC AGTTCTGGGG CGTGGATGAG GACGCACAGG CCTTCACCAG CTTCGTCACC CAGTTCCTGG CGAAATTCGG CCGCTACAAT TCGCCCAAAT ACCTGTTCGG CGAAAGCTAC GGCACCATGC GGTCCGCCGT GTTGATCAAC GATCTGCAGA ACCAGGAAAA CATCGATTTC AACGGCGTGA TCCTGCTGTC GCAGATCCTC AGCTACGACA ACAGCGTCGA CCAGCCGACG CTGAATCCCG GCGAGGACGA ACCCTTCGTC CTCGCCCTGC CGACCTATGC CGCCACGGCG TGGTACCATC ACAAGCTGCC GCAGCAACCC GCCGACCTGA AGGCCTTCCT GGCCGAGGTC GAGCATTTCG CCAGCACGGA CTATCTGGTG GCCCTGCAGC AGGGCGCGGC GCTGCCGCAG GCCGAGCGCG CGCGCATCAT CGACCGGCTG CATGCCTATA CCGGCCTGCC CGCCGATTAT ATCCGCCGGG CCGACCTGCG GATCGACGGC GGCGAGTTTT CCAAGACCCT GCAATCGGAC AGCGACACGA CCACCGGGCG GCTGGATACC CGCTTCTCGG GCCCCAGCCT GGACCCGCTG AGCCAGGAGG CCGGGTATGA TCCGCAATCG ACCTCGATCA GTTCGGCGTA CGTGTCGCTG TTCAACGACT ATGTGCGCAA GGTCCTGAAT TATGGCGATG GGCTGGCCTA CCGGGCGGAA ATCGCCATCC CGCACTGGGA TTTCCACCAT GCCCCCGGCG GCCCCGACAC CCCGGCCTCG GACGGGCCCG CCAACGTCAT GCCGGACCTG GCACAGGCGC TGAAGACCAA TCCGACGCTG AAGATCATGC TGAACGCCGG ATATTTCGAC CTCGCCACCC CGTATTACGA GGGGATCTAC GAAATGCAGC ATCTGCCGGT CCCGGCAGCG CTGCAGAAGA ACATCGAATT CCAGCAATAT CAGTCCGGGC ACATGGTCTA TGCCAACGTG CCGTCCCTGA CGCAACTGCA TGACAATATC GCGGATTTCA TCCGGCGCAC GGACAACCAG GCCGGGCCGC AATAA
|
Protein sequence | MKTRLSLLLS ASLLCLGQMP ARADDRADAP HAVHAPAHAA PAVERTSSGT VTIGGKTIAY QAVAGTLLVH PQKWDDSDDI VHQETPGSDV HTTDGNPDAT ASMFYVAYFR KDARPESRPI TFVFNGGPGS SSVWLHMGAF GPRRVVTADD THTPASPYRL IDNPQSLLDA TDLVFIDAPG TGFGRITGKD KEKQFWGVDE DAQAFTSFVT QFLAKFGRYN SPKYLFGESY GTMRSAVLIN DLQNQENIDF NGVILLSQIL SYDNSVDQPT LNPGEDEPFV LALPTYAATA WYHHKLPQQP ADLKAFLAEV EHFASTDYLV ALQQGAALPQ AERARIIDRL HAYTGLPADY IRRADLRIDG GEFSKTLQSD SDTTTGRLDT RFSGPSLDPL SQEAGYDPQS TSISSAYVSL FNDYVRKVLN YGDGLAYRAE IAIPHWDFHH APGGPDTPAS DGPANVMPDL AQALKTNPTL KIMLNAGYFD LATPYYEGIY EMQHLPVPAA LQKNIEFQQY QSGHMVYANV PSLTQLHDNI ADFIRRTDNQ AGPQ
|
| |