Gene Gdia_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2017 
Symbol 
ID6975443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2238053 
End bp2240251 
Gene Length2199 bp 
Protein Length732 aa 
Translation table11 
GC content68% 
IMG OID643391546 
Productpeptidase 
Protein accessionYP_002276392 
Protein GI209544163 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTTACC TCGGCCGGGT AATTCCGTTA GCCTCGGCGC ACATGTTTCG TCTGCCCGGA 
AAGGTCATGA ATTTGTCGCA CGATTGTCCG CGCCCGCGCC GTCTCGTTCC CCCCGGGGCC
CGCTTCCGGG CCGGCGGGAT GATGCTGGGG GCGGTCCTGG CCTGCGGTGT GGCGTGGCCG
GCCATGGCGT CGGACGACGC GGCCGATGCC GCCCCGGCGG CGAGTGTCGC GGCGCCCCCG
CCGCCGGTGC TGCCCGCGCC GGACCCGGCG CGCATCTTCG CCCCCCTGGC CTATCCCTCG
CCGGTCAACG TCTATCGCTC CGGCGGCGGC GCCCCCGGTC CCGCGTACTG GCAGAATGGC
GCGGATTACG ACATCAAGGC GGCGATCGAC CCGGCCAACC GGCTGCTCTC GGGCTCCGAG
ACCATCACCT ACGTCAACAA CAGCCCCGAC ACGCTGGACG TGCTGTGGCT GCAACTGGAC
CAGAACATCT ATCGCGACGG GTCGCGCGGC AGCTTCGCCA ACCCCGAAAG GACCACGCAG
CATACGGACG GCGCGGCGAT CGAAAGCGTG GCGGTCGAAC AGGACGGCAA GAGCATCCCG
GTCACGCCGG TGATCTCGGA CACCCGGATG CAGGTCCCGC TGCCTGCCCC GCTGGACGGC
AAGGGCGGGA AGATCCGGCT GAAGATCGCG TGGCACTATA CCATTCCCGG GGAATGGGGC
GGCCGCACCG CCGTCAGCGC CAGCCGCAAC GGCGATATCT ACGAGGTCGC GCAATGGTAT
CCGCGCATGG CCGTCTATGA CGATATTCGC GGCTGGGACA CCGCGCCCTA TCTGGGCCAG
GAATTCTTCC TGGACTATGG CGATATCGAT TACAGCATCA CCGTGCCGTG GAATTTCACC
GTCGTGGGAT CGGGCGCGCT GCTGAACCCG GCCGACGTCC TGACCCAGAC CGAGCGCGAC
CGCCTGGCCC AGGCCGCCAA GAGCGACGAG CGGGTGATGA TCCGCACCCA GGCCGACGTC
ACCGACCCCA AGAGCCACCT GGCCCAGACG GGGACGAAGA CCTGGCATTT CCGCATGAAC
AACACCCGCG ACGTGGCCTT CGCCGCCTCG CCGGCCTTCC TGTGGGACGC GGCGCGGCTG
AACCTGCCGC CGCTGGCGGC GGAACCGGGC CGCGCGCCCG TGCCGCGCCT GGCGATGTCG
GCCTATCCGG TCGAAGGCAT CGGCGCGCAT CAATGGGACC GCTCCACGGC CTATGTGAAG
TTCGCGATCG AGAATTTCTC GCGCCGCTGG TACCCGTACC CCTGGCCCAA CGCGGTCAAT
CTGGGCGGTT ATGGCGCGGG GATGGAATAT CCCGGCATCG TCTTCGACGG GATGCACGAC
AAGGACGCGG AACTGTTCTG GATCACGACG CATGAGCTGG GCCATGACTG GTTCCCGATG
ATCGTGGGGT CCAACGAACG CCGCAACGCC TTCATGGACG AAGGCTTCAA CACCTTCATC
GACACCTACG CATCCGACGA TTTCAATTCC GGCGAATTCG CGCCGAAGCG CGATGCCGAA
TTCGCTCCCG CCACCGGCAA GCCCGCCGAC GACATCCTGA CGGTCCTGCG CGATCCCGCC
GCGCCGCGCC TGATGGCGCC CGCGGACACG ATATCCGAGA AGTACCGCCA CCCCGTCACC
TACTTCAAGG CGGCCTACGG GCTGAAGCTG CTGCGCGAGC AGATCCTGGG GCCGGAGCGC
TTCGACCGCG CCTTCCGCCG CTATATCGCG CTCTGGGCCT ATCACCACCC CACGCCGTCG
GACTTCTTCC GCCTGATGGA CAGCGAGGCC GGCGAGGACC TGTCCTGGTT CTGGCGCGGC
TGGTACTTCA ACAACTGGGC CCCGGACTAT GCCGTGAAAT CCGCGACGCT GACCGGCGAC
GGCCCGACGC GCGCGGTGCT GGTGACCGTG GACAACAAGG GCTGGCTGCC CCTGCCGGTC
ACGCTGGTGC TGACCTATAC CGACGGCAGC ACCTCGCGCC TGACCATCCC GACCGAAACC
TGGCAGTTGC GCAACGAGGT CGAACTGACG ATTCCCACCG CCCGCACGCC CCAGTCGGTG
ACGCTGGACC CGGGGCATGA GATCCCCGAC CTGAACCGGG CGGACAACAC GCTGACCCTG
GCCCCGCCCG CCCCCGCGCC GGCCGCCCCG GCGCACTGA
 
Protein sequence
MAYLGRVIPL ASAHMFRLPG KVMNLSHDCP RPRRLVPPGA RFRAGGMMLG AVLACGVAWP 
AMASDDAADA APAASVAAPP PPVLPAPDPA RIFAPLAYPS PVNVYRSGGG APGPAYWQNG
ADYDIKAAID PANRLLSGSE TITYVNNSPD TLDVLWLQLD QNIYRDGSRG SFANPERTTQ
HTDGAAIESV AVEQDGKSIP VTPVISDTRM QVPLPAPLDG KGGKIRLKIA WHYTIPGEWG
GRTAVSASRN GDIYEVAQWY PRMAVYDDIR GWDTAPYLGQ EFFLDYGDID YSITVPWNFT
VVGSGALLNP ADVLTQTERD RLAQAAKSDE RVMIRTQADV TDPKSHLAQT GTKTWHFRMN
NTRDVAFAAS PAFLWDAARL NLPPLAAEPG RAPVPRLAMS AYPVEGIGAH QWDRSTAYVK
FAIENFSRRW YPYPWPNAVN LGGYGAGMEY PGIVFDGMHD KDAELFWITT HELGHDWFPM
IVGSNERRNA FMDEGFNTFI DTYASDDFNS GEFAPKRDAE FAPATGKPAD DILTVLRDPA
APRLMAPADT ISEKYRHPVT YFKAAYGLKL LREQILGPER FDRAFRRYIA LWAYHHPTPS
DFFRLMDSEA GEDLSWFWRG WYFNNWAPDY AVKSATLTGD GPTRAVLVTV DNKGWLPLPV
TLVLTYTDGS TSRLTIPTET WQLRNEVELT IPTARTPQSV TLDPGHEIPD LNRADNTLTL
APPAPAPAAP AH