Gene Acid345_2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2120 
Symbol 
ID4069546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2534841 
End bp2536442 
Gene Length1602 bp 
Protein Length533 aa 
Translation table11 
GC content57% 
IMG OID637984135 
Producthypothetical protein 
Protein accessionYP_591195 
Protein GI94969147 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.981673 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAACAAT TCAGGAAGAT TCTCGCTCTT GTAGTGCTTG CGAGCGCGGC GACCTTTGTT 
CTCGCGCAGG ATGAAAGCCT GGGCGACGCA GCGCGCAAAG TTCGCGACAA CAAGAAAGAT
GATGTCCAGG TGAACAAGGA CGACGCCAAG GAACTCTTCC AGTCGATGAA CAAGATCATG
ACGTTTGCCA GCGCGGACAG CGGCTTCGGG CGGCGGACCG CGGTGAACCA CAAGATGCTG
GGGCGTCCAG ACGTGGAAAA GCATTTCAAC GACAGCCTGC AGGAGGAAGT GAAGAAGCAA
AGGCTGGCGG AGTCGCAGGT TGTGCTGCAA AAGTTTGGTC TTTTGCCGGG CGATTTCAAC
CTGGAAACTT TCCTTGAGAA GAACACAGCG AAAGCGCTCG GTGGATTTTA CGATCCGCGC
GACAAGACGA TGTATCTCCT GAACTGGATC CCACTCGATC GGCAGAAGGA CATCATGGCG
CATGAGTTGA CCCATGCGCT GCAGGACCAG AATTACAACA TCATGAAGTT CGAGGGTTGG
GACCCGAAGC AGGCGGACGC GAGCACCGTG AAGATGTCTG TGGACGACGA GGATGGTGAG
CGGCAGACGA CGCGGCGTGC GGTGATAGAG GGGCAAGCCG AAGTCGTACA TTACGATTAC
ATCCTGCAGC CCTACAGCCT GAGCTTGTCA GATGGATCCG GCGCTCTGGA ATTGATCCAG
GACGCGATTC GGATGAGCTA CGACAACGCA GTTGTATTCA AAATCGCACC CAGGCTGCTC
CGCGACACAT CATTCTTTCC ATATCGTGAA GGGTTCAACT TTGAACTCGA ACTGTTGAAG
AAGGGCGGAC GAAGCATGGC CTTTTCGACC CCGTATTCGC GCCCGCCGCA CGGTACGCAC
GAAGTGCTTC AGCCTGAGAC GTACATGAGT GGAACCCATG TGGCGCCGGT GAAGATTCCA
GACCTGTCGT CATATCTCGG AAGCGCGTAC GAGGCCTACG ACTCAGGTTC GATGGGCGAG
CTCGATGTCC AAGTAATGGC GCAGGAATTC GGCATTGAGA ACGATGTCTA TACGGTGGCG
CGGAAGTGGA ACGGAGGAGC GTACCTCGCT CTGAAGAAGA CATCGCATCC CAAAGATCAG
CCGATGACAA CCTCGGACCT GGCGCTGGTG TACCTGTCGA AATGGGGCAC TGAAAAGGCC
GCCACGCGCA TGGCCCAGAT TTATCTCGAC GCGCTCGGTA AGCGGGTGCA GATCACGGAA
GCGCCGACCA TCACCACGCA TGATTGCGAA GCCGCCAAGT GTCCGACCGC GCTGTGGGAG
GCGCACCTGA AGACGGCGGA CGGGCCGGTG AACCTGGAGG TATGGCCGAA GGCGACCCTA
CTGATCACCG AATCGCTGGA CGATGACATG GTGTCGAAGC TACGGGTTCC GCTGCTCGCG
CCCCCGACGA AGAAGGGTGC GGCGGCTCAG GTTAAAACGC CTTACGGAAC GGACGAGTTG
GCAATGCGTC TTTACGATGA CTCTCGCTTC GCAGATTTGG CAGAAAGCGT GGCCGCGGAG
ATTGCAGCCC ACGCCGCCGA ACGGCTACAG AAAATACATT GA
 
Protein sequence
MQQFRKILAL VVLASAATFV LAQDESLGDA ARKVRDNKKD DVQVNKDDAK ELFQSMNKIM 
TFASADSGFG RRTAVNHKML GRPDVEKHFN DSLQEEVKKQ RLAESQVVLQ KFGLLPGDFN
LETFLEKNTA KALGGFYDPR DKTMYLLNWI PLDRQKDIMA HELTHALQDQ NYNIMKFEGW
DPKQADASTV KMSVDDEDGE RQTTRRAVIE GQAEVVHYDY ILQPYSLSLS DGSGALELIQ
DAIRMSYDNA VVFKIAPRLL RDTSFFPYRE GFNFELELLK KGGRSMAFST PYSRPPHGTH
EVLQPETYMS GTHVAPVKIP DLSSYLGSAY EAYDSGSMGE LDVQVMAQEF GIENDVYTVA
RKWNGGAYLA LKKTSHPKDQ PMTTSDLALV YLSKWGTEKA ATRMAQIYLD ALGKRVQITE
APTITTHDCE AAKCPTALWE AHLKTADGPV NLEVWPKATL LITESLDDDM VSKLRVPLLA
PPTKKGAAAQ VKTPYGTDEL AMRLYDDSRF ADLAESVAAE IAAHAAERLQ KIH