Gene Acid345_2892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2892 
Symbol 
ID4071193 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3432847 
End bp3434535 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content65% 
IMG OID637984910 
Producthypothetical protein 
Protein accessionYP_591967 
Protein GI94969919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGCAG GCATCATGAA GTTCTTCCTC TCCGTCGCGC TCTGTTTGTT CTCGGTGTGC 
TCGTTTGCGC AGAACGCCGC CGTCCAGCAA GCTGCGAAGG CCGCCTCTCT TTCCGGCACG
CTGACCAAGT ACGGCAGCGG CGAGCCCCTT CGCAACGCCG AAATCTCCGT CAACCGCGTC
TCCGACACCT CGCCCGCCTC CGACGAAAGC ACCTCCAACC AGGTCAGCGT CGTTACCGAC
GCCGCCGGTC GCTTCCGCTT CCCCAGCCTC GTCCCGGGTG ACTACACCAT CGTGGTCCGC
AAGAACGGCT TCCACGGCTT CCGCGGACCG AACAGCCACA CCTGGCAGGA ATTTCTCAAC
ATCACGCTCT CGCCCGGACA AGCGGTCAAC GACCTCGCCC TCGCGATGCA GCCCGGCTCC
GTCATCAGCG GCCGCGTCCT CGACGAAGAC GGCGAGCCCA TCGCCTACGT GCAGGTCAGC
GCACTGAAGT GGGTCTACGC CAATCATCGC CGCCAGTTGC GTCCCGTCGG CGTCGGCAAC
ACCGACGACC AGGGCTCCTA CCGCATCTTC TCGCTCGAAC CCGGCCGTTA CATCGTGCGC
GCCAACGTAA TTCAGGACGG CGGCAATTCC AAGCTGCACT ACGCGCCTTC ATACTTTCCG
GAATCGTCAT CGCCCACCGA AGCCAGTCCC ATCGCGCTCC GCCCCGGCGA TCAGGCGCAA
GCCGACTTCC GCATGAGCCG CGTGCCCTCC GTCAAAGTCA CCGGCCACAT CAACGGCACC
ACCTCGGGGG CGCAGACGCA GGTCTATCTC CGCAACGCAC ACGATGAAGG CGCGTCCATC
CAGCGCTCCG CCGGCGCCAC CATGGATCAC AACGGCAACT TCACGCTCGA AGGCGTACTC
CCCGGCGACT ACACCATCTC CGCCCTCGAG TTCCGCGGCG ATAACAACGA CAACCCTCTG
CACGCCGAGG CCCCCATCCA CGTCGACGGC ACCGACCTTC CCAACATCTC GCTCACCCTC
GACGAAGGCG GCCGCTCCGC GCTCCAGGGC ACCCTCACCG TCGACACTCC GAACGTCCAG
CACCCGCGCC TCGATACCCT GCGCATCGGC CTGCTTCCCG TCGACGACAC CAACAACGAG
TTCGCCGGAA ACGGCGGTTA CTCCGCCATC GGCCCCAACG GCGCCATCCG CCTCGATCGC
ATCGTTCCCG GCAAATACGT CGTCTCTCTC ACCGCCGAAG GCTCAGGCTG GGAAGACTTC
TACACCAAGT CCGTCACCAT CGCCGGTCGC GACGTCACCG ACGCCGTCGT CAATATCAGT
TCCAGCCGCG GCGTCGTCCC CATCACCGTC ACCGCCGGCA CTGACGGCGC CTTCGTCGAG
GGCTCCGTCA CCGACGACGA AGGTAAGCCC GTCGCCAACG CTATCGTCAT CGGCGTCCCC
GACCCCGCTC TCCGCGCCCA GTTCGACCTC TACCAGCGTG GCGAAACCGA CCAGAACGGC
CACTTCCGCC TCCGCGGCAT CAAGCCCGGC GCCTACTCCT TCTTCGCCTG GAGCTCCATG
GACGACGAGT CCTACATGGA TCCCGACTTC CTCCGCGCCT TCGAAAACTC CCGGGTCGAC
CAGAGTCTCG CCCCCAACTC CCACCAAAAG GTCGAATTAA AGCTCCTCTC CCCCGACGAC
GCCCAATAA
 
Protein sequence
MSAGIMKFFL SVALCLFSVC SFAQNAAVQQ AAKAASLSGT LTKYGSGEPL RNAEISVNRV 
SDTSPASDES TSNQVSVVTD AAGRFRFPSL VPGDYTIVVR KNGFHGFRGP NSHTWQEFLN
ITLSPGQAVN DLALAMQPGS VISGRVLDED GEPIAYVQVS ALKWVYANHR RQLRPVGVGN
TDDQGSYRIF SLEPGRYIVR ANVIQDGGNS KLHYAPSYFP ESSSPTEASP IALRPGDQAQ
ADFRMSRVPS VKVTGHINGT TSGAQTQVYL RNAHDEGASI QRSAGATMDH NGNFTLEGVL
PGDYTISALE FRGDNNDNPL HAEAPIHVDG TDLPNISLTL DEGGRSALQG TLTVDTPNVQ
HPRLDTLRIG LLPVDDTNNE FAGNGGYSAI GPNGAIRLDR IVPGKYVVSL TAEGSGWEDF
YTKSVTIAGR DVTDAVVNIS SSRGVVPITV TAGTDGAFVE GSVTDDEGKP VANAIVIGVP
DPALRAQFDL YQRGETDQNG HFRLRGIKPG AYSFFAWSSM DDESYMDPDF LRAFENSRVD
QSLAPNSHQK VELKLLSPDD AQ