Gene Acid345_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0540 
Symbol 
ID4069998 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp667547 
End bp669514 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content56% 
IMG OID637982545 
Producthypothetical protein 
Protein accessionYP_589619 
Protein GI94967571 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.920015 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.216808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAGT TCTGGTTTGC CCGCAACACT ACCTTGTGTG CTGTGGTTTT CCTCGGTTTA 
GGTCTGGCCC GTGCGATTGA TTTTCCGCCT GTCACTCCTG AACAACTTGC GATGAAAGAC
AATCCGGCGC AGCCCGGGGC GAAGGCGATG ATTCTGTATC GAGCCATTGA GCGCGACGAC
AGGATGGGCT CGCAGGCTGA GTATGCCCAA ATCAAGATCT TCACGCAGGA AGGCAAAGAC
TACGGCGATG TTGTACTGGA TTTCGATCGG GGTGCATACA CGGTCGAATC GATCAAGGGC
CGGACAATTC ATCCGGACGG AACCGTGATT CCGTTCAATG GCAAGGCGTA TGAAAAGGTG
ATTGCCGAAG GCCAGGGATT CAAGGTTCAC CGCAAGGCAT TCACCATGCC CGACGTGACG
CCCGGCAGCG TCCTTGAATA CAAGTACGTA GTGCGATGGG AGGCCTCGGA CCCGGCGACA
CACCAATATT ATTATTTCCC TCGCTCCGAG TGGGAGGTAT CCAAGGAGCT CTACCAGCAG
AGCGCGCATT TTGTGTTTAA GCCGCTGACG ATGGACGGAC TCTATTGGTC CCTGCGCAGC
AACCGCCTTC CGCCGGATGC GAAGTTCAAT CACGAACAAC TTACGGACAA GGTCACCCTG
GACCTGGTGA ATGTACCTGG CGTGGAGAAG GAAGAATTCA TGCCGCCGTC GTCAGAGACG
AAGGCGCGAG TGCTCTTCTT CTATAGCGAT ACCCATATTC CGGAGCCCGA TCAGTATTGG
AAAGACCATG GAAAGAAGTG GCACGGCTGG GCGGAAGGCT TCATGGACAA AAAGGGCGCG
ATCCAGAAGG ACCTCGCCAG CGTGATTTCG TCGAGCGATT CGACGGATGT GAAGCTCCGG
AAGATTTATG AGCACGTGCA GTCATTCGAG AACCTGGAAT TCGAATCGGC GAAGAGCGAC
AAGGAAATCA AGGCGCTGAA GATCCGGGAC ATTAAGAGCA TTGAAGACGT GATCAATGGC
AAGGCCGGGT ATCGCAATGA GCTGAACCGT ACATTCGTCG CGCTGGCGCG TGGAGCAGGA
TTCGATGCGA CGCTGGTGGC GGTGACGGAG CGCGATACGG CCATCTTCCA CAAAGAGTGG
CCATCTTCGT CGCAGCTCGC TTATGAGATC CCGCTGGTGA AGGTGAACGG TGCAGATATT
TACCTCGATC CGGGGAGCCC GTTTTGCCCG TTCGGCGTGG TGCCATGGGA AGACACCGCA
GTTTCAGGAC TGAAGCTCGA TAAGAACCCG CCGGTTTGGG CACAGATTCC ACTTCCACCC
AGCGATGACT CGAGCATTAA GCGCGTCGCG AAGATGACGT TAGGCGACGA CGGTTCTCTG
ACCGGCGAGG TCGAAGTGAC GTTCACGGGG CAGGATGCGT TCCATCATCG GCTCTGGGAG
CGCAACGAGG ACGATGCCGG CAAGAAGAAA GATATGGAGG AATTACTCCA GGACTGGATG
GCGCTGAAGG CGGATATTGA GTTGGAGAAG GTAAATGATT GGAAGGCGTC CAATGTTCCG
CTCGTCGCGA CCTTCAAGGT GACGGTGGCA GGCTACGCGA GCCAGGCCGG CAAACGCGTG
CTGATTCCCT GCACTTTGTT CGCCGCTGCC TACAGGAACC CGTTCACTCC GACGAAGCGG
GTGAACCCAA TCATCATGCA TTACGCCTAC GACCGCAGTG ACGACGTCAC GATCAAGCTG
CCGGCGAATT TCCAGGTGGA GAGCATGCCG AAGCCGGTCG CCGAACAGAA CAACATCGCG
GACTTGAACG TGAAGTGCGA CAGCAACAAC GGAACGTTGC ACCTGGTGCG GGACTTCAAG
CTGAAGGGCC TATTCATTGA TCAGAAGTAT TACGGAGCGG TTCGCGGATA TTTCCAGCAG
GTCCAGGCGG GGGCAAATGA GCAAGCTGTA CTCAAGATGG GAAATTAG
 
Protein sequence
MKKFWFARNT TLCAVVFLGL GLARAIDFPP VTPEQLAMKD NPAQPGAKAM ILYRAIERDD 
RMGSQAEYAQ IKIFTQEGKD YGDVVLDFDR GAYTVESIKG RTIHPDGTVI PFNGKAYEKV
IAEGQGFKVH RKAFTMPDVT PGSVLEYKYV VRWEASDPAT HQYYYFPRSE WEVSKELYQQ
SAHFVFKPLT MDGLYWSLRS NRLPPDAKFN HEQLTDKVTL DLVNVPGVEK EEFMPPSSET
KARVLFFYSD THIPEPDQYW KDHGKKWHGW AEGFMDKKGA IQKDLASVIS SSDSTDVKLR
KIYEHVQSFE NLEFESAKSD KEIKALKIRD IKSIEDVING KAGYRNELNR TFVALARGAG
FDATLVAVTE RDTAIFHKEW PSSSQLAYEI PLVKVNGADI YLDPGSPFCP FGVVPWEDTA
VSGLKLDKNP PVWAQIPLPP SDDSSIKRVA KMTLGDDGSL TGEVEVTFTG QDAFHHRLWE
RNEDDAGKKK DMEELLQDWM ALKADIELEK VNDWKASNVP LVATFKVTVA GYASQAGKRV
LIPCTLFAAA YRNPFTPTKR VNPIIMHYAY DRSDDVTIKL PANFQVESMP KPVAEQNNIA
DLNVKCDSNN GTLHLVRDFK LKGLFIDQKY YGAVRGYFQQ VQAGANEQAV LKMGN