Gene Gdia_2501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2501 
Symbol 
ID6975930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2751044 
End bp2752090 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content62% 
IMG OID643392018 
Productintegrase family protein 
Protein accessionYP_002276860 
Protein GI209544631 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.137002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0538174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCGCA AAAGACCAGA TCGGCCATTT CTGGAACTCT ATCGAGGGAC GTGGTGCGTC 
GTCTGGTGGG AGAGCGGAGA GCGCAAGCGA AGCTCGACGG GTACTGCGGA TGAAGAGGGC
GCTCGGCGCG CTCTAGCCGA CTTCGAAGCG GCGCTGTCAG CACGTCCGAA CGGTCAGCTT
CTATCTGAGG CGCTTGATAT TTACGTTTCG GCCCGGGCTG GGAAGGTGAC GGCCCTCAGC
CGCCTCGAAG AGGCGGCTAT TCGCATCAAT GAGGGAATGG GGCATCTTCG GATCAACCAG
ATCCATCAGC GCCAATGGGA CGATTACGCA GCAAGCCGCT TTCGCAAGCC GAATGCGCGG
AGCAAGCGCC CAGTCGAGGG GGCGCCCGTC CCGATATCGC TCGGAACCCT GAAGCGGGAA
TTCAACGTTC TACGCGCGGC GCTGCGTCAC GCCTGGCGTA ATCACAGGCT CGACAAGCCG
CCGACTTTGG AGGGGCCGGG AGGCAGCGCG CCGCGCGATC GCTACATCAC CAAGGCCGAG
GCTCGCCGCC TTTTGGACGC TTGCGAGACG CCGCATATCC GCGCGTTTCT GGCGCTGGCG
ATGTTCACGG GCGCGCGAAA GGGATCGATT CTCGCTCTCA CTTGGGATCG GGTGATGTTC
GATCTGGGTC GCATCGACTT CCAGGAACCT GGGCGGAAGT TGACGGCCAA GCGCCGTGCA
ATCGTCCCGA TGACGGATGA CCTGCGGGCA GAATTGACCG AGGCGCACAA GGTCCGGACA
TGCGACTATG TGGTCGAATG GGCTGGAGGT CCCATCACCT ATGGCATCCG CTGGCCATTG
AAAAAGTTGG CGCAGAAGGC TGGTCTGTCA TGGACGCCCA CGCCCCATCA CTTCAAGCAC
AGTGTGGCGT CATGGATGGC CATGGCCAAG GTGCCTATTG ATCAGGCGGC CGACTGGCTT
GCCACCGATC CCAAGACGCT GCGTCGAGTC TACCGGAAAT TCGATCCGGA TTATCTGCGG
GAGGTAGGGT CTGCCCTGAA ACTATAG
 
Protein sequence
MPRKRPDRPF LELYRGTWCV VWWESGERKR SSTGTADEEG ARRALADFEA ALSARPNGQL 
LSEALDIYVS ARAGKVTALS RLEEAAIRIN EGMGHLRINQ IHQRQWDDYA ASRFRKPNAR
SKRPVEGAPV PISLGTLKRE FNVLRAALRH AWRNHRLDKP PTLEGPGGSA PRDRYITKAE
ARRLLDACET PHIRAFLALA MFTGARKGSI LALTWDRVMF DLGRIDFQEP GRKLTAKRRA
IVPMTDDLRA ELTEAHKVRT CDYVVEWAGG PITYGIRWPL KKLAQKAGLS WTPTPHHFKH
SVASWMAMAK VPIDQAADWL ATDPKTLRRV YRKFDPDYLR EVGSALKL