Gene Acid345_1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1921 
Symbol 
ID4071032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2309746 
End bp2310777 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content63% 
IMG OID637983933 
Producttransposase IS116/IS110/IS902 
Protein accessionYP_590996 
Protein GI94968948 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.876079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCG TAGCGGTCGG TATCGATCTC GGTAAAACCG TGTTTCACCT GGTGGCGATG 
GGAGAGCGCA ACCGCGTGCT GGTGCGGCGG AAGTTCTCGC GTCAACAGCT GTTGGCTTAC
ACGGCCAACC TCGAGCCCAC TTTGATCGGC ACCGAAGCCT GTGCCGGCGC CCATTTTCTC
GCGACAGCCC TATGTGCGCA GGGACACGAC GTGCGCCTCA TGGCAGCCCA GTTCGTGAAG
CCTTACCGTA AGTCGAACAA GAGTGACTTT CTCGATGCCG AGACGATCGC CGACGCGGTG
CAGAAGGAGA ACATGCGCTT CGTGCCAGTC AAGACCGACG AGCAACTCGA TCTGCAGGCT
ATGCACCGCG TGCGCACTCG CCTGGTGCAG CGGCGCACGG CACTGATCAA CGAGATCCGC
GGGTTCCTGC TGGAGCGCGG CATCATCTTT CCCGCGAAGC CGATTCACCT GCGCAAGCAA
CTTCCGGGTG TACTGGAAGA CGCGACCCAG AACCTGACGC CGAGGCTGCG CTGGCTGCTC
TCTGAACTTG CGGAGGAGTG GAAGGAGTTG GAAGCTAGGA TCATCGCTAT CAGCGACGCC
ATCGAGCGGA TCAGCACCAG CGATCCACTC TGCCAGCGTC TGCGCCAGAT CCCAGGCTTC
GGGCCGCTGG TTTCGACAGC AACCGTGGCC GCTATCGGCA ACGGGTCGTC GTTCCGCAAG
GGTCGCGACT TCGCGGCGTG GCTCGGTGTT GTTCCCCGAC AGTACTCCAC GGGTGGCAAG
ACGGCGCTCT ACGGCATGAG CAAACGCGGC AACCGTTATC TACGACAGCT GCTGATCCAT
GGCGCGCGTG CTGTCCTGAT CCGGGTGAAG TACGACACCG CAGGGTTGGG GCAGTGGATC
CACAAGCTGG CCGAGCGTGC ACCGCGCAAC AAGGTGATCG TCGCGATCGC CAACAAGCTG
GCGCGTATCG CCTGGGCGGT ACTCGCGAAG GGTGAGCCTT ACCGCCATCA GCCCTTGGCG
GCCGCAGCGT AG
 
Protein sequence
MRIVAVGIDL GKTVFHLVAM GERNRVLVRR KFSRQQLLAY TANLEPTLIG TEACAGAHFL 
ATALCAQGHD VRLMAAQFVK PYRKSNKSDF LDAETIADAV QKENMRFVPV KTDEQLDLQA
MHRVRTRLVQ RRTALINEIR GFLLERGIIF PAKPIHLRKQ LPGVLEDATQ NLTPRLRWLL
SELAEEWKEL EARIIAISDA IERISTSDPL CQRLRQIPGF GPLVSTATVA AIGNGSSFRK
GRDFAAWLGV VPRQYSTGGK TALYGMSKRG NRYLRQLLIH GARAVLIRVK YDTAGLGQWI
HKLAERAPRN KVIVAIANKL ARIAWAVLAK GEPYRHQPLA AAA