Gene Acid345_3761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3761 
Symbol 
ID4069336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4444125 
End bp4445795 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content52% 
IMG OID637985783 
Producthypothetical protein 
Protein accessionYP_592835 
Protein GI94970787 
COG category[L] Replication, recombination and repair 
COG ID[COG1637] Predicted nuclease of the RecB family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCCT TCTCCGAAGC CGAGCTGCGC GACTTTCTCG CTGACAACAT AACGAGGATT 
GAGCCGGGAC TGACGCTTCT CGATAAAGAA AAGTACATCC CGAACGCAAT CGGCACCCGG
GGGTTCATAG ATTTGCTCGC TCAGGACGAA CGCGGCCATT TCGTACTCAT TGAGTTGAAG
CGCAGTGATG CTGCGTCTCG CGAAGCCATT CACGAAGTAC ACAAGTACGT TGAAGGTGTT
AAACGTCACT TGGGAGCGAG AGACGACGAG ATCCGTGTAA TTGTCGCCTC TACGGAATGG
CGCGAGCTAC TTCTTCCCTT TTCGCAATTT CTCGAATCAA CCCGAATCAC GGCGGAAGGC
ATTCGGTTGA TTGTAGGCGA GAGTGGTCTC CCTTCTCTCA CGGCTGAAAA GGTGGAGCCA
GTTCGCGTCA GTCGCGGTCG ATTCTTGGCT CCTTGGCACG AACTGAACCA CTACACATCT
GAGGACAGCC TCGCAAAGGG CATTCTCGAC TACGAGCATT CATGTCGCCT AAAGGGCGTT
GACGATTATG TTCTGGTCGT CTTTGAGGCA AGTCCCGACT GGTATCCGCT GGCTCAGGAA
GAGTTTCGGG CCTCCATGAT TCAAATGCAG GAGCAATTTG GAGTGCATGA CCCTGCCGAG
ATAGATGAGA TGGTGGCCAA ACTGCCGAAC TTTCGTTTCG CTTTGTACTT CGCATCCCAG
GTGCTCGGAC GAGAGTACTG CCTCGAAGTA CTGCGGCAGA ACTCCGAGGA CATGGAAGAG
CATGAAGAGA TCATCGATGG CATGGAAGAA GAAGAAGCGC TGCAATATCT GAACGACGCG
GTCCACAACT TGGCACCCAA AAAGCACAGA GACGGCTTTG AAATTGGCTA TCCCGGTAAG
TTTCAAACTC GATTCCGTGT TGGGAACCTC TGGATTCTGA AGCGGATTCA GCGTTACGGA
ATGTTTCAGA GGAACACATT GCTCTCCGAG GAAGAGATTC TGGAGGAGCT AGCCGGAAGT
GAAGGAGTTA CGGGGCAGCG CTTCAAGCGG CAAATTACGA TCAACAACAA GAGCCACATC
GCATCCGCTA AAGATGGTCT CCGAGAGTGC CTGGAACACA ACCCCGTTTG GCTTGCCCAC
ACCCTGAAAA TCATCGACGA AATTGAGAAG GATTATCCTG AGCTGGAGGC CAGCATCGAC
GTGTTCAACC CTTCGACTGG GTTGATGACG ATATATTTCG CAGCAACCCG GCCCGAGCCG
TTTGCATTCA TTCCACTTTT CACCATCGCA GTGCGGGACG GCGCGGCGAC CATCAGAGCT
TACCTTGGCT GTTTAGAAGG GGTCAGTTCA CCGATGAGCT TTCAATCGTT GATTGACAAA
TACTATGATG GCCAGATTGG AGTGTTGCTG CTGTCGGTAA CGTGGGGAGG CTATGAGCAG
AGGGACACAG ACATTCTCGA AGATTGCGGC CTCTTCTATC GTTCGTTCCG CGTAGATGAC
ATCGGATCGA CGAACGCCTT TAGCAGTTTG AGAGACGAAC GGTGGCGGCC GGTACCCCAG
TTTTTTCCAC CTGAGAAGTT CTCCGAACAC CTCGATCATC ATGGTGAATT TGTCATGGAA
CTTCTTCGCG AGATTGGTTC TCGTGACAAG GGCAGCTACT TCGAGAGCTA G
 
Protein sequence
MPPFSEAELR DFLADNITRI EPGLTLLDKE KYIPNAIGTR GFIDLLAQDE RGHFVLIELK 
RSDAASREAI HEVHKYVEGV KRHLGARDDE IRVIVASTEW RELLLPFSQF LESTRITAEG
IRLIVGESGL PSLTAEKVEP VRVSRGRFLA PWHELNHYTS EDSLAKGILD YEHSCRLKGV
DDYVLVVFEA SPDWYPLAQE EFRASMIQMQ EQFGVHDPAE IDEMVAKLPN FRFALYFASQ
VLGREYCLEV LRQNSEDMEE HEEIIDGMEE EEALQYLNDA VHNLAPKKHR DGFEIGYPGK
FQTRFRVGNL WILKRIQRYG MFQRNTLLSE EEILEELAGS EGVTGQRFKR QITINNKSHI
ASAKDGLREC LEHNPVWLAH TLKIIDEIEK DYPELEASID VFNPSTGLMT IYFAATRPEP
FAFIPLFTIA VRDGAATIRA YLGCLEGVSS PMSFQSLIDK YYDGQIGVLL LSVTWGGYEQ
RDTDILEDCG LFYRSFRVDD IGSTNAFSSL RDERWRPVPQ FFPPEKFSEH LDHHGEFVME
LLREIGSRDK GSYFES