Gene Acid345_4438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4438 
Symbol 
ID4070920 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5269437 
End bp5271860 
Gene Length2424 bp 
Protein Length807 aa 
Translation table11 
GC content63% 
IMG OID637986476 
Producthypothetical protein 
Protein accessionYP_593512 
Protein GI94971464 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGA AGCTGTCGAT CATCCTTTTG TTCGTACTCG TCTGTTCGCT CGTCTCCGCA 
CAAACTACGC GCAACGTCAG CGGTCTCACC CGCCACGACG GCTTCATTCC CTTCGAGTGG
GACGAATCCA AGGGCGAACT CCTCTTCGAA CTCACGCCCA CCGCGATGCA GCGCGAGTTT
CTCTACTTCA CTTCTCTCGC CAGCGGCGTC GGTTCCACCG AACTCTTCGC TGACCGCAGC
ACCCTCGGCG AGCGCGCTCA GCTTTGCCGT TTCCGCCGCG TCGGGCCCAA AGTGTTACTC
ATCTCCGAGA ACACCGGGTT TCGCGCGTCC AACGGCAGTG CGGAACTTCA GAAGTCCGTC
GAAGCCAGCT TCCCAACCTC TGTTCTCGCT GCGATGCCGA TTGTCTCCGA GACCAACGGC
ACGCTCTACG TGAACGCCAC CTCTCTCATC GTGCGCGATG CCTTCGATCT CGTCGGACAA
TTCGAGCGTC CGCTGCGCGC CACCAACGGC AACATCCGGC CCGTTCCCGC CGGACCCGAC
GCCCCCAAGT GGAAGCTCGA TGCCGAGCGC AGTGTCGTGG ACATGGACCA CACCCGCGCT
TTTCCGCTTA ACACCGAAGC CGAAGCGCTC CTCACCTTCA CTACCGACAA GCCCGGCGCG
CGTTTCAATC AGCCCGATGC CCGCACCCTC AGCGTGCGCC AGCACCATTC CTTCATGCAG
CTGCCCGAGT CGGGCTACGA ACCGCGCGAG AGCGATCCCC GTGTCGGCTA CTTCGGCAAT
GGCTTCCAGG ACTTCTCCCG CCCCTACAAT CAGCCCATCG AACGCCTGTT GATCAACCGC
TGGCGCTTGG TGAGGAAGAA TCCCGGCGCC GCACTCAGCG AGCCGGTGAA GCCCATCACC
TTCTATCTCG ACCGCGCCAT GCCTGAGCCC ATCCGCAGCG CCGCGCGACA GGGCGCATTG
TGGTGGAACC AGGCCTTCGA ACAGGCGGGC TTCAAGAACG CGCTCGTCGT CGAAGACCTG
CCCGAAGGCG CGGACCCGCT CGACATGCGC TATCCGACCA TTCAGTGGAC GAACCGCTCC
GGCCGCGGCT GGTCGGTCGG CATGGTGCAA ACCGACCCGC GTACTGGCGA GATCCTGCAC
GCTATCGTGC AGCTCGACTC GCACCGGATG CGCACCGTCC ACAACTACTG GAACGTGCTG
CAGCCTCCCT CCGCGAACGC CGAGCCTGAT CCCGGAATCT TCTCCGAACT CGATCGCGCC
GACCCGCGTC TTACGGAAGA CGAAGCTATG ACGCGCCGTC TCGCGCTGCT CACGTGCCAC
GAGATGGGAC ATGTCCTCGG TCTCGACCAC AACTTCGTCG CCAGTACCTT CGGCCGCGGC
AGCGTGATGG ATTACTTCGC GCCGCGCGTG AAGATCCGCG CCGACGGCAC CGCCGACATG
AGCGACGCTT ACATGCAGGG CGTCGGCAGT TACGACAAGT TCGCCATCGA GTGGGGCTAC
AGCACGCTCG GCAACGAACC GCCCGCCGCC GAAGCGCAGC GCCTCGAAGC CGTCGTGGAG
CGATGGAACA AGCAGGGCGT CTTCTGGGGC AACTTTGAAG ACCCGCGCTG GAATGCTTAC
GACGACGGCA CCGATCCTGT TACCTGGCTC AAGCAAGTCA TGCCCGTACG CGATGCGCTG
GTGAAGCTCT ACACGCCGGC GCTCATGCGC AAAGGCGAGC CGTGGTCAGA CTTCGCCTCG
CGCTATGCGC TCATCTACCT TTTCCACCGC TACGGACTTG GAGCAGCGGT GAATGTCGTC
GGCAGCGCGA AAGTTCCACC TGCGCTCGTT GGTGACGGCA ACAAGCCCTT CGAAGTCTGG
CCCGCTGATC AGCAGCGCGA AGCACTCAAC CTGCTAACCT CCGCGCTCGA TCCGAGAGAG
CTACGCATCG CTCCCGAAGT CTGGTCCGCG CTAGTGCCGC TGGAAAACCG TGACTACGCC
GACAACGAGC GCTTCAAATC GCCGTCAGGT TATGTCTTCA GCCCTCAAGA CGGCGCGCGC
GCCGTGGCTG ATGTCGTCGT CGGTGGTTTA CTTATGCCGC GCCGGATCGA GCGCCTGATT
GCAATCCACA CCGAAGACGC GAATGCCGTG GGCGCCGACG AAGTCATCGA CGCACTGGTG
AAGCGTGCCG CAGCCGATGC CAACGATCCG CTCGGCGAAG TCGTGCAGTC TTCTGTCGCC
GAGCAGCTAA TGGCGCTGGC AGCCGATGAA ACGGCTACGC CGGAAACCCA AGCCGCAGCC
TATCGCGGCA TACTCGCGTC GCAGCAGGCA ATCGGCACTT CCAACCCGCG CCTCGCAAAT
GAAATCGAGC GCTTCCTCCG CGATCCGAAG AACAACACGC CGAAGCCGAA GCCGAGCGGA
GCGCCGGAAG GCCCACCCGT TTAG
 
Protein sequence
MKLKLSIILL FVLVCSLVSA QTTRNVSGLT RHDGFIPFEW DESKGELLFE LTPTAMQREF 
LYFTSLASGV GSTELFADRS TLGERAQLCR FRRVGPKVLL ISENTGFRAS NGSAELQKSV
EASFPTSVLA AMPIVSETNG TLYVNATSLI VRDAFDLVGQ FERPLRATNG NIRPVPAGPD
APKWKLDAER SVVDMDHTRA FPLNTEAEAL LTFTTDKPGA RFNQPDARTL SVRQHHSFMQ
LPESGYEPRE SDPRVGYFGN GFQDFSRPYN QPIERLLINR WRLVRKNPGA ALSEPVKPIT
FYLDRAMPEP IRSAARQGAL WWNQAFEQAG FKNALVVEDL PEGADPLDMR YPTIQWTNRS
GRGWSVGMVQ TDPRTGEILH AIVQLDSHRM RTVHNYWNVL QPPSANAEPD PGIFSELDRA
DPRLTEDEAM TRRLALLTCH EMGHVLGLDH NFVASTFGRG SVMDYFAPRV KIRADGTADM
SDAYMQGVGS YDKFAIEWGY STLGNEPPAA EAQRLEAVVE RWNKQGVFWG NFEDPRWNAY
DDGTDPVTWL KQVMPVRDAL VKLYTPALMR KGEPWSDFAS RYALIYLFHR YGLGAAVNVV
GSAKVPPALV GDGNKPFEVW PADQQREALN LLTSALDPRE LRIAPEVWSA LVPLENRDYA
DNERFKSPSG YVFSPQDGAR AVADVVVGGL LMPRRIERLI AIHTEDANAV GADEVIDALV
KRAAADANDP LGEVVQSSVA EQLMALAADE TATPETQAAA YRGILASQQA IGTSNPRLAN
EIERFLRDPK NNTPKPKPSG APEGPPV