Gene Gdia_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1799 
Symbol 
ID6975220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1993000 
End bp1994595 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content56% 
IMG OID643391323 
Productintegrase family protein 
Protein accessionYP_002276174 
Protein GI209543945 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0370566 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.65053 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGCG CCATCCCGTC CCGACGCACA GGACAGCCCC GTTATCACTT CCGCCGTATC 
GTCCCGGCAA TGCTCCGCCC CCTGCTGGGC AAGACCGAGA TTTCCCTCGT CCTCCATACC
TCTGATAAAC TGGTCGCCCG TGAACGTGCA GCCGCCCTGT ATGCCAGGAC GGGACAGCTT
TTCAGAGCAG GCAAGATGCA AAACCCCTCC AGAGAGGACC TTCTGGTCCT CTACAGCGAA
TTGATTGAAA ACTACGAAAT CGCTCTGAAG GCTCAGGAGG AGAGTGCCAG AAAAGTGCTG
GAAGCCGAAC GGGCCCAGCA TCTCATTGAA AAAGCCGATC TGGTCTCACG ACAGTCGACC
TTCCTCAACA AGGTCCAGCC CCACATGGAC AGCCTTCTTC AGGCACTGGG GCAACTGAAG
CACACGCTCG ACAGACGGGA TGTCTGGAAC GTCGCTGAAA AACAGGGACT GCAGAACCAG
ATCACCACCC TCAGCACCCT GATCCAGACC AGCCTGCATA GAGGTTCAGA GAACCCGGAA
GAACAAACAC CTTCCCCCAA ACAGCCACGC AGCATGAAAC TGTCGGCGGC GGCTGAGAAA
TTTGTCTTCT CGGTTTCCTC CAAAAGTGCC GGGACCATCA AGGGAACGGG CAAGACCGTA
GCCCTCTTCA CCGAAGCCTT CGGCGATATG CCAGTTCATC AGGTAACCGG GGAAGTCGTG
GGAGAATTTT ACGACCTACT GTCAGGCCTC CCGGCAACAC ATGGCAAAGG CAAGACCACT
CTTCCCCTTC GGGACGCGGT CCGAGAGGCC CAAAAAAGCG AGGGAGAAAC CGTATCGGGC
AAAACCGTCA AAAACCATTT CTCGCGCATG TCCGCGATCT GGAGTGAACT GGTCCGGCGT
GACCACGCCC CCAAAAATCC ATGGGCCAAC TGGTCTTTCG ATCTGACGCA GAAAACCAAC
CGCCGGGCCT GGTTGGAAGA GGAACTGAGA ACACTCCTGA ACTCGAAATG GCTGGGCCGG
GTTTTCCCCG AACGCACCTA TCGGGGCATT GTGCGCATAG CCCTCTATAC CGGCATGCGT
CTGGGTGAGA TCTGCAATCT GAGAAATCAG GATATCGAAA CCCTGAACGG CATTCCCTGT
TTCCATATCC GGCCCCATAC AGTCGAGATC GACGGCAAAA CCCGGGAATG GTCTCCCAAG
ACATCAGCCG GCACCCGTAT CATTCCCATC CACAGCAAAC TGCTGGAAAA AGGGATCATT
GAGGAATTCA GAAATTCCGG CCCCTATCTC TTCAGCGAAC TGCCCATCTC CGCTTCCGGG
GTCCGAGGCG CAAACTTCGA AATGGTCTTT TCCAAACATA AACGGCGTCT GAACCTGCCA
GCGGACGTCA CCTTTCACTC CTTCCGCCAT CTGGTTTCGA CAGTGCTCAG AAACCAGGAC
AGTCACATCC GGGAACTCTG GATTGATGAT CTGCTGGGTC ATGAAGCCAC CCACAAAAGT
CAGGGCACAA CCCAATACAC GTCAGCTATT GATCTGCAGA ACCTCCAGCG GGTTGTGGAG
GCCATTACCT ATCCCGACGA CATCGCAAAC TGGTGA
 
Protein sequence
MLRAIPSRRT GQPRYHFRRI VPAMLRPLLG KTEISLVLHT SDKLVARERA AALYARTGQL 
FRAGKMQNPS REDLLVLYSE LIENYEIALK AQEESARKVL EAERAQHLIE KADLVSRQST
FLNKVQPHMD SLLQALGQLK HTLDRRDVWN VAEKQGLQNQ ITTLSTLIQT SLHRGSENPE
EQTPSPKQPR SMKLSAAAEK FVFSVSSKSA GTIKGTGKTV ALFTEAFGDM PVHQVTGEVV
GEFYDLLSGL PATHGKGKTT LPLRDAVREA QKSEGETVSG KTVKNHFSRM SAIWSELVRR
DHAPKNPWAN WSFDLTQKTN RRAWLEEELR TLLNSKWLGR VFPERTYRGI VRIALYTGMR
LGEICNLRNQ DIETLNGIPC FHIRPHTVEI DGKTREWSPK TSAGTRIIPI HSKLLEKGII
EEFRNSGPYL FSELPISASG VRGANFEMVF SKHKRRLNLP ADVTFHSFRH LVSTVLRNQD
SHIRELWIDD LLGHEATHKS QGTTQYTSAI DLQNLQRVVE AITYPDDIAN W