Gene Acid345_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1905 
Symbol 
ID4069383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2287588 
End bp2288688 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content59% 
IMG OID637983916 
Productlipolytic enzyme, G-D-S-L 
Protein accessionYP_590980 
Protein GI94968932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0113749 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATCTCCC GACTTCTTGC CCTCATCCTC GCGCTTACTT CCTTCACTGT CGCAGCGCCA 
CCGGTGAAGT TCCAAATCTC TCCCGTATCC TCCAGCGAGA AGACTGGTCT CCACGTTCTC
CCCATGCACA TCGGCGGACG CGTCCTCCAG CGCGGCGCCG AATCTGCTCC GTCGTACGAA
CGTCAATGGC CCGGCACTTA CTTCGATACC GCCTTCATCG GCCCGGACGT CTATTTCAAA
CTCGGCTCCG GCGACGCCAT CCTCAAGATC ACCCTTGACC AATCTTCCCA GTCACTCACC
AAGCCCGCAC CCGGCCTCTA TGTCATTCGC GGACTCACCA ACAAGCATCA CGATCTCCGC
CTTGAAGTCA TCACCGAAAG CCAGGCCGGC CCAACTTCGT TCGACGGATT CTTCGCGCCG
CGCTCTGCCA AGCCGGACAC ACCGCACTCC TATCCACTCC AGATCGAGTT CATCGGCGAC
TCGCACACCG TCGGCTACGG CAACACCTCA CCCAAACGTG AATGTACGGA AGACGAAGTC
TGGGCTACCA CTGACACTTC GCAAGGCATC GCGCCTCTGG TCGCACGTCC CTTCCATGCC
GACTACCAGG TCAACGCAAT TTCCGGCCGC GGTATCGTTC GCAACTACAA CGGCTTTCCC
GGAGATACCC TCCCCGCCGC CTATCCCTTC ACCCTTCTCG ACCATACCTC GCGCTACGAT
AATCCCGACT GGCGTCCGCA GGTCATCGTG GTCTCCCTCG GCACCAACGA TTTCAGCACC
CCACTCCACG CCGGCGAAAA ATGGAAGACC CGCGACGAAC TTCACGCCGA CTACGAGCAG
ACCTACGCCG AATTTCTCCA CCAACTCCGC GCCAGGAATC CCAAGGCCTA CTTCATCCTC
TGGGCAACCG AGATGTCTGA CGGTGAAATC CTCGCCGAAG TCCAAAAGGT CGCCGACCGC
GTTCGTTCCG CGGGAGAGAA ACAGATTTCA GTTGTGCCAG TGAAGGAATT AGAAGTGACG
GGCTGCAACT ACCATCCGTC TTTGACGGAC GACCGCAAGA TCGCTGACGC CATAGTGGCC
GCGATAAAAG CAAAAAACTA G
 
Protein sequence
MISRLLALIL ALTSFTVAAP PVKFQISPVS SSEKTGLHVL PMHIGGRVLQ RGAESAPSYE 
RQWPGTYFDT AFIGPDVYFK LGSGDAILKI TLDQSSQSLT KPAPGLYVIR GLTNKHHDLR
LEVITESQAG PTSFDGFFAP RSAKPDTPHS YPLQIEFIGD SHTVGYGNTS PKRECTEDEV
WATTDTSQGI APLVARPFHA DYQVNAISGR GIVRNYNGFP GDTLPAAYPF TLLDHTSRYD
NPDWRPQVIV VSLGTNDFST PLHAGEKWKT RDELHADYEQ TYAEFLHQLR ARNPKAYFIL
WATEMSDGEI LAEVQKVADR VRSAGEKQIS VVPVKELEVT GCNYHPSLTD DRKIADAIVA
AIKAKN