Gene Gdia_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1034 
Symbol 
ID6974431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1163972 
End bp1165027 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content61% 
IMG OID643390556 
Productputative replication protein 
Protein accessionYP_002275432 
Protein GI209543203 
COG category[L] Replication, recombination and repair 
COG ID[COG5534] Plasmid replication initiator protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.135734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCGA CGGGCAGAAC GCGCTCCGAG CGCGAGCAAC TGGAACTCTT CCACGCTATC 
GCGGGAGATT TCGCGCCTCG CGATGCGCAG GATCTGATGG CGTTTCCGTT CTTCAGCCTC
GCCAAATCCC CTCGCATGGT CCCGATCGAT TATCGGACCC CCGATGTCAC GATCCGCGTC
GAGGCGTCCG CCGAACATGG CATGGCCACG ATCTGGGATG CCGACGTGCT GATCTGGGCC
GCCAGCCATC TCGTTGCCGC GCGCGACGCC GGTCGGCGCA CGTCGCGGCT GATGATGGCC
AGCCCGCGCG AGATCCTGAC CTTCATCGGT CGGGGCGACA GCGCGCGGGA CTATGAGCGG
CTGGAAGCAG CCTTCGATCG GCTGCAATCC ACCACGATCA AGACCTCGCT GCGGCAGACC
GGCAAGGGGC AACTGCACCG CTTTTCCTGG ATCAACGAAT GGAAGCGACA TACCGCGCGG
GAAGGCCGCA CCCGCGTGAT CGAACTGATC CTGCCCGACT GGTTCTACCA GGCGGTGCTC
GATGACGCGC TCGTTCTGAC CATCGATCCG GCCTATTTTA ACCTCACCGG CGGTCTGGAG
CGCTGGCTAT ATCGCATCGT GCGCAAGCAT GGTGGTCGTC AGCGCGCGGG CTGGGCCTTC
GGCCTTCGCC ATCTCTACGA AAAATCCGCC AGCCTTTCCC CCTATCGCCG CTTTGCCTTC
GAACTGCGCG AGATGGCGAA ACGGCAGCCC TTTGCCGGCT ATCGGCTGTC GGTGCGCCCC
GACCGCAACG GCAATGACTC GCTGGCCTTT GCACCTGTCA AACTATCCAC AGGCGCCTGT
GGACAAGCTG TGAATTCATC CGTGCTATCA GTTGTGGATT TATCCGTGCC ATCACTGCCA
CCGCATCCGT GCTATCGTTT GCGGAAAACG CCGAATCACA ACATTGAATC AAGTGGTTAT
GACGCCCTTA ACTTAGAATC TAACTTAAAA GAGTCTAACT TTAAGGATGT TGGCCCCCCC
GCCGATCCGT GGATAAGCCC CGGAGAGGGG TCATGA
 
Protein sequence
MSPTGRTRSE REQLELFHAI AGDFAPRDAQ DLMAFPFFSL AKSPRMVPID YRTPDVTIRV 
EASAEHGMAT IWDADVLIWA ASHLVAARDA GRRTSRLMMA SPREILTFIG RGDSARDYER
LEAAFDRLQS TTIKTSLRQT GKGQLHRFSW INEWKRHTAR EGRTRVIELI LPDWFYQAVL
DDALVLTIDP AYFNLTGGLE RWLYRIVRKH GGRQRAGWAF GLRHLYEKSA SLSPYRRFAF
ELREMAKRQP FAGYRLSVRP DRNGNDSLAF APVKLSTGAC GQAVNSSVLS VVDLSVPSLP
PHPCYRLRKT PNHNIESSGY DALNLESNLK ESNFKDVGPP ADPWISPGEG S