Gene Gdia_3304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3304 
Symbol 
ID6976744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3611033 
End bp3612667 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content65% 
IMG OID643392815 
Productpeptidase S10 serine carboxypeptidase 
Protein accessionYP_002277646 
Protein GI209545417 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2939] Carboxypeptidase C (cathepsin A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.169507 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0135899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGACAC GCCTGTCGCT TTTGCTATCG GCCTCGCTGC TATGCCTGGG CCAGATGCCC 
GCCCGGGCCG ATGACCGCGC CGATGCGCCC CATGCCGTCC ACGCCCCCGC CCATGCCGCG
CCGGCGGTGG AGCGGACCAG CAGCGGAACC GTGACGATCG GCGGCAAGAC CATCGCCTAT
CAGGCCGTGG CGGGCACCCT GCTGGTACAT CCGCAGAAAT GGGACGATTC GGACGATATC
GTCCATCAGG AGACCCCGGG CAGCGACGTC CACACCACCG ACGGCAACCC CGACGCCACG
GCGTCGATGT TCTATGTCGC CTATTTCCGC AAGGATGCCC GGCCCGAATC CCGGCCCATC
ACCTTCGTCT TCAACGGCGG GCCGGGATCG TCCTCGGTGT GGCTGCACAT GGGCGCGTTC
GGTCCGCGCC GGGTGGTGAC CGCCGACGAT ACCCATACCC CCGCCAGCCC CTACCGGCTG
ATCGACAATC CGCAGAGCCT GCTGGACGCG ACCGACCTGG TCTTCATCGA CGCGCCCGGC
ACCGGCTTCG GCCGGATCAC CGGCAAGGAC AAGGAAAAGC AGTTCTGGGG CGTGGATGAG
GACGCACAGG CCTTCACCAG CTTCGTCACC CAGTTCCTGG CGAAATTCGG CCGCTACAAT
TCGCCCAAAT ACCTGTTCGG CGAAAGCTAC GGCACCATGC GGTCCGCCGT GTTGATCAAC
GATCTGCAGA ACCAGGAAAA CATCGATTTC AACGGCGTGA TCCTGCTGTC GCAGATCCTC
AGCTACGACA ACAGCGTCGA CCAGCCGACG CTGAATCCCG GCGAGGACGA ACCCTTCGTC
CTCGCCCTGC CGACCTATGC CGCCACGGCG TGGTACCATC ACAAGCTGCC GCAGCAACCC
GCCGACCTGA AGGCCTTCCT GGCCGAGGTC GAGCATTTCG CCAGCACGGA CTATCTGGTG
GCCCTGCAGC AGGGCGCGGC GCTGCCGCAG GCCGAGCGCG CGCGCATCAT CGACCGGCTG
CATGCCTATA CCGGCCTGCC CGCCGATTAT ATCCGCCGGG CCGACCTGCG GATCGACGGC
GGCGAGTTTT CCAAGACCCT GCAATCGGAC AGCGACACGA CCACCGGGCG GCTGGATACC
CGCTTCTCGG GCCCCAGCCT GGACCCGCTG AGCCAGGAGG CCGGGTATGA TCCGCAATCG
ACCTCGATCA GTTCGGCGTA CGTGTCGCTG TTCAACGACT ATGTGCGCAA GGTCCTGAAT
TATGGCGATG GGCTGGCCTA CCGGGCGGAA ATCGCCATCC CGCACTGGGA TTTCCACCAT
GCCCCCGGCG GCCCCGACAC CCCGGCCTCG GACGGGCCCG CCAACGTCAT GCCGGACCTG
GCACAGGCGC TGAAGACCAA TCCGACGCTG AAGATCATGC TGAACGCCGG ATATTTCGAC
CTCGCCACCC CGTATTACGA GGGGATCTAC GAAATGCAGC ATCTGCCGGT CCCGGCAGCG
CTGCAGAAGA ACATCGAATT CCAGCAATAT CAGTCCGGGC ACATGGTCTA TGCCAACGTG
CCGTCCCTGA CGCAACTGCA TGACAATATC GCGGATTTCA TCCGGCGCAC GGACAACCAG
GCCGGGCCGC AATAA
 
Protein sequence
MKTRLSLLLS ASLLCLGQMP ARADDRADAP HAVHAPAHAA PAVERTSSGT VTIGGKTIAY 
QAVAGTLLVH PQKWDDSDDI VHQETPGSDV HTTDGNPDAT ASMFYVAYFR KDARPESRPI
TFVFNGGPGS SSVWLHMGAF GPRRVVTADD THTPASPYRL IDNPQSLLDA TDLVFIDAPG
TGFGRITGKD KEKQFWGVDE DAQAFTSFVT QFLAKFGRYN SPKYLFGESY GTMRSAVLIN
DLQNQENIDF NGVILLSQIL SYDNSVDQPT LNPGEDEPFV LALPTYAATA WYHHKLPQQP
ADLKAFLAEV EHFASTDYLV ALQQGAALPQ AERARIIDRL HAYTGLPADY IRRADLRIDG
GEFSKTLQSD SDTTTGRLDT RFSGPSLDPL SQEAGYDPQS TSISSAYVSL FNDYVRKVLN
YGDGLAYRAE IAIPHWDFHH APGGPDTPAS DGPANVMPDL AQALKTNPTL KIMLNAGYFD
LATPYYEGIY EMQHLPVPAA LQKNIEFQQY QSGHMVYANV PSLTQLHDNI ADFIRRTDNQ
AGPQ