Gene Acid345_3588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3588 
Symbol 
ID4072810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4239476 
End bp4240699 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content59% 
IMG OID637985611 
Producthypothetical protein 
Protein accessionYP_592663 
Protein GI94970615 
COG category[S] Function unknown 
COG ID[COG3503] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0967112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.500812 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTGC CTCAACTACA GACTTCTCCT GGACAGAGTT CCGCGATCGC AACTGGAACA 
CGCGTGGCCT CCGCGGACGT TCTGCGCGGA ATGGTGATGG TGCTGATGGC CATCGACCAC
GTGCGCGTGT ATTCCGGCGT TCCTGCCGGC GGACCGACTG CGGGCGTGTT CTTTACGCGA
TGGGTCACGC ATTTCTGTGC GCCCGGGTTT GCATTTCTTG CGGGAAGCGG CGTGTTTCTC
TACGCGCAGA AACACAGCGG CGTCTCGAAG TTCCTGGCGA CACGCGGGCT GTGGCTGGTG
TTGCTGGAAC TCACCGTGGT GCGCGTGGCG TGGACGTTCA ACTTTCATTT CCTCGACTAC
GACCTTGCGG GCGTGCTGTG GATGCTCGGC TGGTGCATGG TCATGATGGC CGCGCTCGTC
AAGCTTCCGA TCAAAGTCGT CGGCGCCGTG GGGCTCGCGA TTATTTGCCT GCACAACCTG
ATGGACCATG CGATGCCGGG GATTGTGAAA TCGCTGGAGA CGAACCCCTA CGCCGCACTC
TGGAAATTTC TCTACGTCGG ATTTTGGGCA GGGCCGGTGC AGATGGGGGC GCACGGGCCT
ACGGTGTACG TGCTCTATTC GCTGATCCCG TGGATTGGCG TGATGGCCGC GGGGTATGCG
TTTGGATCGG TCTTCGCGCT TACGCCGGAG CGTCGGCGGC AAGTGTGCAT GATGGTCGGA
CTGGTGGCGA TCGCGGCGTT TGTTTTACTG CGCGCTACGC ACTTCTATGG CGATCCGCGA
CCGTGGCAAG GGTGGCAGAC GGTGCATGGG ATGCCGCCGC TGCTGGCTTT CCTGAATACG
ACGAAGTATC CGGCTTCGCT CTGCTTTTTA CTGATGACGC TGGGGCCGGC AATCCTCGCG
ATTCCGCTCT TTGAAAGTGG CAGCTCTGCA TTTGCGAAGG TGATGATGGT ATATGGCAGA
GTGCCGTTCT TTTTCTACCT GCTGCACATT CCGCTCATTC ACGCGCTGGC GCTGATCGTT
TCGAAGATAC GTTTTGGCTA CGTGGACCCG TGGTTGTTCA CGAACCATCC GATGGGGAAC
CCTCAACCAC CCGACGGGTA TGTGTGGAGT TTGCCGTTGT TGTATTTGGT GTGGGCAATC
GCGGTATTCC TGCTCTTCTT CGCGTGCCGA TGGTATGCCG CGGTGAAGGC CCGCCATAAA
GATGGACGGT TGCGGTATTT GTGA
 
Protein sequence
MTVPQLQTSP GQSSAIATGT RVASADVLRG MVMVLMAIDH VRVYSGVPAG GPTAGVFFTR 
WVTHFCAPGF AFLAGSGVFL YAQKHSGVSK FLATRGLWLV LLELTVVRVA WTFNFHFLDY
DLAGVLWMLG WCMVMMAALV KLPIKVVGAV GLAIICLHNL MDHAMPGIVK SLETNPYAAL
WKFLYVGFWA GPVQMGAHGP TVYVLYSLIP WIGVMAAGYA FGSVFALTPE RRRQVCMMVG
LVAIAAFVLL RATHFYGDPR PWQGWQTVHG MPPLLAFLNT TKYPASLCFL LMTLGPAILA
IPLFESGSSA FAKVMMVYGR VPFFFYLLHI PLIHALALIV SKIRFGYVDP WLFTNHPMGN
PQPPDGYVWS LPLLYLVWAI AVFLLFFACR WYAAVKARHK DGRLRYL