Gene Acid345_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3039 
Symbol 
ID4071946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3607205 
End bp3608647 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content60% 
IMG OID637985058 
Producthypothetical protein 
Protein accessionYP_592114 
Protein GI94970066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0289115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGCCTG CGCTCCTCGT GTGCCTGCTG CTCTGCGGCT TTGTGATGTC CGCGCAATCG 
CCTCCGCCGA TCAAGATCGG CGATGTCACC TTCTCCGGCA GCATTCGCGA GCGCGGCGAG
GCTTGGGATT TCTTCGACGC CGGAACCGGC GGCAACGCCT ACGGCTTCTC CGGCACCACC
ATCCGTTTTG GATTCTCGCA ACAGAAAACG TCATATGACT GGAACGTCGA ATTTCTCTCA
CCCATCCTTC TCGGTCTTCC CGAGCAGGCC GTTCAACCTG CGCCCTTGGG ACAACTAGGC
CTCGGCGGTT CTTACTACGC CGCGAATGAC AAGCAACAGA ACGTCGGCTT CGTAAACCTG
AAGCAAGCAT TCATTCGCTT CAAATCGAAA CAGGACTCTC TTCGCATCGG ACGCTTCGAT
TTCAACGACG GCACCGAAGT CGCTCCCGCC GATCCCACAC TCGCCGCATT AAAGAGCGAT
CGCATCTCCC AGCGTCTCAT CGGCGTCTTC GGTTTCTCTG ACGTCCTCCG CAGCTTCGAC
GGCGTCCAAC TCTCCTCGAA CCGCGGACCC TGGAACATCA CGGCTCTCGG TGTAATTCCT
ACGCGCGGCG TCTTCCAGGT AGACGCGTGG GGATGGGTCA AAACTCCGTT TGTCTATGGC
GCCGTTACTC GGCAGGCCAA TCTCGGCCGC AAGACCAAAG CCGACTGGCG CATCTTCGGC
CTTTACTACA ACGACGATCG TCCCATCGTG AAAACCGATA ATCGCTCAAC TGCACTCCGT
GCAAGCGATC TCGGCGGGAT CAACCTCGGC ACCTTCGGCG GACACTTCCT CTCCGCCACG
CCCTCCGAAT CAGGAACCTA CGACTTCCTC GCCTGGGGAG CCGCGCAATT CGGCACCTGG
GGACAACTCG ATCACCGCGC CGCAGCCGCG TCCATCGAAG GCGGATGGCA GCCTACACTC
TGGAAGCCCG CGCGTCCCTG GTTCCGCACC GGTTACGACT GGACCAGCGG CGATGCCGAC
GCGAAGGATG GCACTCACGG CACCTTCTTC TCCGTGCTAC CTACGCCACG CATCTACGCG
CGATTTCCGT TCTTCAACGC GATGAACAAC CGCGACATTT TCGGAGAAAT AGTGTTCCGC
CCTGGACCGG CCGTCACCAC GCGCACAGAC ATCCACAGCC TCTGGCTCAG CAGCAGCCAC
GATCTCTGGT ACTCCGGCGG CGGCGCATTC CAGCCGTGGA CCTTTGGCTA CGCCGGTCGT
CCCGCGAACG GCTCAACGTC ACTCGCTACG CTCTACGACA CCAGCATCGA CATCAAGGCC
ACATCCGCTT TGAATTTCGG CTTCTATTTC GGATACGCCG TCGGCGGCGA TGTGATCAAA
AAGATCTTCA CCACCAACGC CAACGGCGCA TTGGGATTCA TGGAAGTGAC CTACAAGTTC
TAG
 
Protein sequence
MKPALLVCLL LCGFVMSAQS PPPIKIGDVT FSGSIRERGE AWDFFDAGTG GNAYGFSGTT 
IRFGFSQQKT SYDWNVEFLS PILLGLPEQA VQPAPLGQLG LGGSYYAAND KQQNVGFVNL
KQAFIRFKSK QDSLRIGRFD FNDGTEVAPA DPTLAALKSD RISQRLIGVF GFSDVLRSFD
GVQLSSNRGP WNITALGVIP TRGVFQVDAW GWVKTPFVYG AVTRQANLGR KTKADWRIFG
LYYNDDRPIV KTDNRSTALR ASDLGGINLG TFGGHFLSAT PSESGTYDFL AWGAAQFGTW
GQLDHRAAAA SIEGGWQPTL WKPARPWFRT GYDWTSGDAD AKDGTHGTFF SVLPTPRIYA
RFPFFNAMNN RDIFGEIVFR PGPAVTTRTD IHSLWLSSSH DLWYSGGGAF QPWTFGYAGR
PANGSTSLAT LYDTSIDIKA TSALNFGFYF GYAVGGDVIK KIFTTNANGA LGFMEVTYKF