Gene Acid345_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2016 
Symbol 
ID4070345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2414440 
End bp2415954 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content57% 
IMG OID637984030 
Productrhomboid-like protein 
Protein accessionYP_591091 
Protein GI94969043 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGTTGC CCATTGGGCG TGAGAAAAAG ATAGTCAAAC GATTGCCGAT CGTGACTGTC 
ATCCTGATTC TCACTAACAT TCTCGCTTTC TGCCTGACGA TCCGCGACAT CGATGACGAT
ACAGGCAACC GGAACCTCAA CACGGTCCGC AACCACTTGC TGGTCATGAA GGCGCGGTTC
CCGGACGTCG TGTTGGATAC CGAAGCGCAG CAGATGGTGG ACGACTTCCG GAAGACTCGA
CCCGAAGCCT GGCAGATGGT TGCGGACCCG AACCGCGAGC CGATGGACAC ATGGGAAGCC
GTACTAGTCG ATGAAGAGAA CCCGAAAATC GAGAAGCTCC AGGAGCAGGT AAACCTGCTC
TGCATCGAGT TTCGCGACCT TCAAAACCGA GACAACTCCG TCCTCTGGAT GTACGCATTC
CACTCCTATC ATCCTAAGTA CCGGAGTTAC ATCAGCCATC AGTTCCTGCA CGGAGGGTTC
TTTCACCTCC TCGGCAACAT GTGGATGTTG TGGCTGTGCG GCGTTGTCCT GGAAGAAGTC
TGGGGACCCT ACGTGGTGCT GGGCTTCTAC CTCTGTGCCG GAGTATTCGC AGCCGCGGCG
CATGGCGCCA TGAACCCGAA TTCGCTCATA CCGATGCTTG GAGCCTCGGG ATCGGTGGCC
GGCCTCATGG GCGGCATGCT GGTGCGCTAC CCGAAGCTCA AAGTAAAGAT GTTGTTCTGG
CTGTTCTTCT ACTGGCGAAC ATTTTTCGCG CCGGTTTACA TCCTTGCGCC GCTGTGGTTT
GTTGCCGAAC TGTTCTGGGG CGGGCTCGGT GAGCGCGGCA TTGCCCACTG GGCACACGTT
GGGGGCTTCG CATTTGGAGC GGTCGTGGCG CTGGCCTTCG ATTTCGGCCG CGTCGAGAAG
ATTACGAACC CGGAAGAACC TGTGCCGGTA GTCTGGAAGC CCGACACCGA GTTCCTGCAT
GCGGCGCAGT TGCTGGAAAA ACGCGAGACG AATACTGCGC TCGCCATTCT CCGAAACTAC
GTGAAGAAGA ATTCGAATGT GATCGACGCA TGGGAATTGT TGCAGCAGGC GCAGATTCAG
AAGAACGATG CAAACGAGCA GCGTCAAGAA ACACTTCCGG TCCTGATTCG TCTTTATCTC
GGGGTTGGAA ACGACGAACG GGCGCTTTTG CACCTGCGCG AGTTCCGCAG GCTCGGAGGA
ACGATCCTTC CAGCTTCAAC GTGGCTTGAA CTCGCACGAA GCTACGAACG GGTGGAACAG
TGGGAAATCG CCGCCCGCGA ATTCGAGAAC CTTGGCATTT CGTATTACGC CACGGACCGA
ACATCGCTGA CGGCGCTGCT GAGCGCAGCC AGAATCTATC TCACGAAGCT CGATCGCCCG
GCCGATGCAA ACCGCCTTTA TCAGGCGGCC GGCAACTCGC CGATACCACA CTTGGAAATG
GATGCGGTGA TCAAGCATGG GATCAGCCAG TCCGCCGCTG CGAATACCGC AAAGAGCAAT
GGTGTAGCGT TCTAG
 
Protein sequence
MLLPIGREKK IVKRLPIVTV ILILTNILAF CLTIRDIDDD TGNRNLNTVR NHLLVMKARF 
PDVVLDTEAQ QMVDDFRKTR PEAWQMVADP NREPMDTWEA VLVDEENPKI EKLQEQVNLL
CIEFRDLQNR DNSVLWMYAF HSYHPKYRSY ISHQFLHGGF FHLLGNMWML WLCGVVLEEV
WGPYVVLGFY LCAGVFAAAA HGAMNPNSLI PMLGASGSVA GLMGGMLVRY PKLKVKMLFW
LFFYWRTFFA PVYILAPLWF VAELFWGGLG ERGIAHWAHV GGFAFGAVVA LAFDFGRVEK
ITNPEEPVPV VWKPDTEFLH AAQLLEKRET NTALAILRNY VKKNSNVIDA WELLQQAQIQ
KNDANEQRQE TLPVLIRLYL GVGNDERALL HLREFRRLGG TILPASTWLE LARSYERVEQ
WEIAAREFEN LGISYYATDR TSLTALLSAA RIYLTKLDRP ADANRLYQAA GNSPIPHLEM
DAVIKHGISQ SAAANTAKSN GVAF