Gene Gdia_1661 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1661 
Symbol 
ID6975077 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1851749 
End bp1852888 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content60% 
IMG OID643391194 
Producttransposase IS110 family protein 
Protein accessionYP_002276051 
Protein GI209543822 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCCG AGAACGCCAT TCACATTGCC ATCGAACTCA GCGTTTCCTC CTGGCTTGTC 
GCGGTAAAGA CTATCTCGGG AGCCACGAAA TCCCGATTGC ATCGCCTCGA AGGTGGAGAC
GCCTCAGGGC TGCTGAAATT GATCGCGGAG CTTCAAACGC GCGCGTCGAC CCAGCCGGGT
GATGTCGCGG AGGTGTCATG CTGTTTCGAG GCCGGCCGCG ATGGCTTCTG GCTGTATCGT
TTGCTGACAG CGCACGGCAT CGCCGCGTAT GTGCTTGAGC CCACGAGCAT TCTGGTCAAT
CGTCGCGCAC GTCGGGCCAA GACGGACCGT CTCGACGCGG AAGGCATGCT GCGTGTTCTT
GCGGCATGGC TTAATGGTGA TCGCCAGATA TGCAGCATGG TGCGTGTGCC GACGCCCGAC
GAGGAGGATG CCAAACGTAC ACACCGCGAA CGCGAACACC TTGTTCAGGA AAGGCTGCGT
ATCGAAAACA GAATAGAGGC GCTGCTGTTT ACCCAGGGCA TCCGGGGTAG ACCGTCGTTA
CGGTCCTGGG AACGCGACGT CGCCGCGTTG CGCACGGGCG ACGGGCGGGA ACTGCCGCCG
TTCCTTCGTG CTGAACTCGA CCGCCTGCGT CGTCGGCTTC TCCTGGCGTT GGAACTGATC
CGAGAACTGG AAACTGAACG GGCCAAGACA CTGGACGCCG CAGCGATGGA TGACCGTGTG
ACTCAAAAGA TCGTCTCGCT GAAACAGATC CGCGGCATCG GCGAGAATTT CGCTGCCGTT
CTCGTTCGGG AGGTGTTCTA TCGCCGCTTC GACAACCGTC GCCAACTGGC CAGTTACGTC
GGCATTACGC CTATGCCTTA TCAAAGTGGC AGCATGGATC GTGATCGAAG CATCAGCCGG
GCCGGAAACC CGCGAGCGCG GACGGCGATG ATCCAACTCG CCTGGCTTTG GCTACGCTAT
CAGCCCGCAA GCGGGCTCGC CTCATGGTTT CGTGAGCGCG TCGGCACCTT GAAAGGGCGG
ACACGCCGCA TTGCGATCGT GGCCATGGCG AGAAAGCTTC TGATTGCGCT TTGGCGCTAT
GTGGAGACAG GATCGATACC GGACGGTCTC GCATTCGGCA CCGGAACGAC CGCAGAATAG
 
Protein sequence
MSSENAIHIA IELSVSSWLV AVKTISGATK SRLHRLEGGD ASGLLKLIAE LQTRASTQPG 
DVAEVSCCFE AGRDGFWLYR LLTAHGIAAY VLEPTSILVN RRARRAKTDR LDAEGMLRVL
AAWLNGDRQI CSMVRVPTPD EEDAKRTHRE REHLVQERLR IENRIEALLF TQGIRGRPSL
RSWERDVAAL RTGDGRELPP FLRAELDRLR RRLLLALELI RELETERAKT LDAAAMDDRV
TQKIVSLKQI RGIGENFAAV LVREVFYRRF DNRRQLASYV GITPMPYQSG SMDRDRSISR
AGNPRARTAM IQLAWLWLRY QPASGLASWF RERVGTLKGR TRRIAIVAMA RKLLIALWRY
VETGSIPDGL AFGTGTTAE