Gene Acid345_3757 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3757 
Symbol 
ID4069332 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4439377 
End bp4440687 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content52% 
IMG OID637985779 
Productrestriction modification system S subunit 
Protein accessionYP_592831 
Protein GI94970783 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.386531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTGGC CTAAGTCCAC CGTTATGGAA CTTCAGCGGG ACGGGGTCCT GCTCGTCGAA 
GATGGTAACC ACGGCGAAAG TCGCCCCCGC CCCGATGAGT TTGTTAAGCG TGGGGTGGCT
TTTATCCGAG CAGCCGACAT GGATGCAAGC GACGTACTGT TTGATACTGC CTCGCGCATC
AATGACGTCG CTCGTAAACG AATTACGAAA GGAATTGGTG CACCTGGTGA CATTTTGTTG
TCTCACAAGG GAACTGTCGG GAAGGTTGCA CTGGTTCCAG ATGACGCCCC CCCGTTTGTT
TGTAGTCCTC AAACGACTTT TTGGCGGACA CTGAAGGGCG ATCGACTCGA TCGACGCTAT
CTCCACGCAT ATTTGCGTTC ACCTTATTTC CATCAACAGC TTGCCAGTAG GGCAGGCGAG
ACGGACATGG CTCCCTACGT TAGCCTTACG TCTCAGCGTG GCCTTCATGT GCTGATGCCA
GACATTGATA TCCAAAGACG AATAGGGAGC ATCGTTGGTG CGCTTGATGC AAAGATTAGC
GTGGAGCGGA AAATAAAGGG TACGCTGGCA GACATTGCGC GGGCTCTATT TCAATCGTGG
TTCGTTGACT TCGATCCTGT GCGTGCCAAG AGTTTAGGGA GTAGCTCCAG CTTACCTGCG
TCGTTGGAAT CGTTGTTTCC CGATACGTTC GAAGAGTCTG AACTCGGTCA GATTCCGAGT
GGTTGGACCG TTGGGTCTCT GGATCAAATC GCACATTTCC TGAATGGGCT TGCTCTGCAA
AGATTTCCCC CAAACGAGAA CGGCTCACTC CCGGTGATAA AGATCGCGCA GTTGAAGGCT
GGAAACACCG AAGGCGCTGA TCTCGCGAGC CCTAATTTGG ATCCCGGGTA CATCGTTCAG
GATGGCGACG TTTTGTTTTC TTGGTCTGGG TCGCTCGAAT GCGTAGTCTG GTCGGGCGGG
AAAGGCGCAT TGAACCAACA TTTATTTAAG GTCACATCCA AAGATTATCC GAAGTGGTTT
TTCTATCTTT GGATACACAG GCATCTAGAT GAGTTTCGAC GAATCGCCGC AGCTAAAGCG
ACGACGATGG GCCACATACA GCGCTATCAT CTCTCTGAGG CAAAGATACT TCTGCCTCAC
AAGAAATTGC TAGACGCCGC AGACCGTATA ATCGGGCCGC TCATTGAGTC TATCAACGTC
CGCGCTGTCC AATCGAAAAT ACTAGGACGC ATTCGGGATT TGTTGCTGCC GAAGTTGATT
TCGGGAGAAC TGGCGATTGA GGATGACGCA GAGTTTGGAG TCGTCAAATG A
 
Protein sequence
MNWPKSTVME LQRDGVLLVE DGNHGESRPR PDEFVKRGVA FIRAADMDAS DVLFDTASRI 
NDVARKRITK GIGAPGDILL SHKGTVGKVA LVPDDAPPFV CSPQTTFWRT LKGDRLDRRY
LHAYLRSPYF HQQLASRAGE TDMAPYVSLT SQRGLHVLMP DIDIQRRIGS IVGALDAKIS
VERKIKGTLA DIARALFQSW FVDFDPVRAK SLGSSSSLPA SLESLFPDTF EESELGQIPS
GWTVGSLDQI AHFLNGLALQ RFPPNENGSL PVIKIAQLKA GNTEGADLAS PNLDPGYIVQ
DGDVLFSWSG SLECVVWSGG KGALNQHLFK VTSKDYPKWF FYLWIHRHLD EFRRIAAAKA
TTMGHIQRYH LSEAKILLPH KKLLDAADRI IGPLIESINV RAVQSKILGR IRDLLLPKLI
SGELAIEDDA EFGVVK