Gene Acid345_3751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3751 
Symbol 
ID4069326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4426717 
End bp4427763 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content64% 
IMG OID637985773 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_592825 
Protein GI94970777 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTA TAGGCTGCGA TTTGCATGCA CGACAGCAGA CGATTGCGAT GCTGGACGTG 
GAAACGGGCG AACTCGAAGA GCGCCGCCTT GAGCATGAAG GCGACCAGGT GCGGGCGTTC
TATGCGGGAC TACCGCGACC GGTGCGGGTG GGCATAGAAG CGACCGGCGC GATGCAGTGG
TTTCTCGAGT TGATGGAAGA GCTCGAAGTG GAGTGCGTGG TTGGACATCC GGCGCAGATC
CGTGCCGGGG AGCCGCGCAA ACAGAAGAAC GACCGTCGCG ACGCGCGATT GTTGCTGCGG
CTGCTGGCGG AGAACCGGTT CCCGAAAATT TGGATGCCCT CGATGGAGCA GCGCGACGTG
CGCGCCCTGC TGCGACATCG TGAGCAGTGG GTGCGGATTC GCGTGCGGCT GCAGAATGCG
TTGCAGTCGA TCGCGCTAAC GCATGGGATT CGCCGCGGCG CGAGTTTATG GAGCAAGGCC
GGGCAGCGGA CGATAGGCGA GCTAGCGTTG CCGCCGCTCA GCGGCATGCG GCGCGACGAA
CTGATGCGGT TGCGCGTGCA ACTGCATCAG GAGATCGAGC GGCTGGATAA AGAAGCCGAG
CGCGTGGCAC GAGAGCGTCC GCTGGCACGG CTGTTGATGA CGCATCCGGG CGTAGGGCCG
ATCACATCGC TGGCGACGGA GGTGTTCCTC GGCGATGCAA CCCGCTTCGC AGATGGCAAG
GCAGTGGCCA GCTACGTGGG CATGATCCCG TGCGAGCACT CGAGCGCGGG CAAGCAGCGC
CTGGGCAAGC TCACGAAACA AGGCAACGCC ATGCTGCGCA TGCTGTGGTG CGAAGCGGCG
CGACATGCCG TGATCAAAGA CGAAGATCTA CGGCGCTTCT ACCGTCGCAA GCTGGCGCAG
AAGGGCCTGG GGATCGCGCT GGTAGCAGCC GCACGCAAGC TGGGAGTGCG GTTGTGGATC
ATGCTGCGCG AAGAAATCGA CTACGAAGAG TTCTGCCGTC GCGGACGGCA GCGCGGGGAA
GCCCAAGCGG AGAGGCCAGC TCGATAA
 
Protein sequence
MRIIGCDLHA RQQTIAMLDV ETGELEERRL EHEGDQVRAF YAGLPRPVRV GIEATGAMQW 
FLELMEELEV ECVVGHPAQI RAGEPRKQKN DRRDARLLLR LLAENRFPKI WMPSMEQRDV
RALLRHREQW VRIRVRLQNA LQSIALTHGI RRGASLWSKA GQRTIGELAL PPLSGMRRDE
LMRLRVQLHQ EIERLDKEAE RVARERPLAR LLMTHPGVGP ITSLATEVFL GDATRFADGK
AVASYVGMIP CEHSSAGKQR LGKLTKQGNA MLRMLWCEAA RHAVIKDEDL RRFYRRKLAQ
KGLGIALVAA ARKLGVRLWI MLREEIDYEE FCRRGRQRGE AQAERPAR