Gene Gdia_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0543 
Symbol 
ID6973940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp600274 
End bp601515 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content47% 
IMG OID643390076 
Productintegrase family protein 
Protein accessionYP_002274952 
Protein GI209542723 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000277419 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0534863 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAGC ATAGTTCTAG CAGCGTGGCG GGAGCGTTTA ATTCGCAGCC ATGGTCCATC 
TACAAGCAGA CAAACTCAAA AAACTGGTTC GTCCGCTTCT CGATCAAGGG ACAGGGGCAG
ATTCGAAAGT CCCTTCAGAC CGCAGATGAG AAGGAAGCAG ATCGTAAGGC TGCACGGATC
TATTATGAGG CTCTCTTACG TGCAGAGCAG GGCTTGGAAG CCAGAGACAA GACTGTAAGG
GTCCTGGCAG ATGAGTGGAT CGCAGAGGGC GTCCTGAAGG CTCCTGAACG GACTGCTCTC
TCCCGATATA TCGTGGGATA CTTCGGGGAC GATAAGCCAT CAGACATCAC CTACAGGGGG
CTTCAGGCGT ATCGGAAATG GCGCTTAAGC TATTGGACAG AAGGGCCTGG TAAGAATCTG
GAGATGGTCG TCTATCAGCG ACTCGGGCGA CAGATTCGTA GGCCGGTGAC GAGAAAGACC
CCTACCCGCT CCACACTCAA CAGTGAGCAG GTCGTCTTTA AAAAATTCTT AACCCGATGT
CAGAATCTCG GGCACATAAA AACAATTCCC AAGTTTGATA AGATTGAAGG TGAAGTTCAC
AATCGGCCTG GGTTTTCTCA TGCCGAGATT GAAAAAATCT TGTCAGTCCT GAGAAAGAGG
ACCGTCCTCC CCGCTCTATC CAATGAGGAA CGTTATCCAC ATATTCTCCT CTATGGATAT
GTCGGAATTA TGTGTGGTTC GGGGATGCGC CCCATTGAGT GTCAAAAACT CAGATGGATC
GATCTAGTTG GTTTCGATGA ATCTCGAAAC GCTAAACTTT GTGAAGGTCG AATTACCGTC
CGGGTTCATG GCAAAGGTAA ATCAAGAGAG TTTGTACCTC TTGATGGGAC AATCTCTGAT
TTTCTAATGA TATGGGATGT TCAGAAAATA ATACGGGGAT CTGATCCAGA TCCCAATGAT
TATATCTTTG TTGACATCAA GGGTAGGCAT ATCCAGACCT TTAACCCAGA AGTAGTATCA
CTTCTGGAAG AATGCTCTCT GCGACAGGAT TACCGAGGAA TAAAGAGAAC ATCTTATTCG
TTCAGACACT ATTATATTAC GTTCATGATT AACGCACATG CGAACATATA CGATATCGCC
AAAAACTGCG GCACATCAGT AGCTATGATT GAAAAGTTCT ATTCACATGT CACGCTTGAA
TCGATACGAG ACAGACTGCG GCCTTCTGGA ACGCGGATCT AG
 
Protein sequence
MGKHSSSSVA GAFNSQPWSI YKQTNSKNWF VRFSIKGQGQ IRKSLQTADE KEADRKAARI 
YYEALLRAEQ GLEARDKTVR VLADEWIAEG VLKAPERTAL SRYIVGYFGD DKPSDITYRG
LQAYRKWRLS YWTEGPGKNL EMVVYQRLGR QIRRPVTRKT PTRSTLNSEQ VVFKKFLTRC
QNLGHIKTIP KFDKIEGEVH NRPGFSHAEI EKILSVLRKR TVLPALSNEE RYPHILLYGY
VGIMCGSGMR PIECQKLRWI DLVGFDESRN AKLCEGRITV RVHGKGKSRE FVPLDGTISD
FLMIWDVQKI IRGSDPDPND YIFVDIKGRH IQTFNPEVVS LLEECSLRQD YRGIKRTSYS
FRHYYITFMI NAHANIYDIA KNCGTSVAMI EKFYSHVTLE SIRDRLRPSG TRI