Gene Acid345_2122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2122 
Symbol 
ID4072364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2537002 
End bp2538276 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID637984137 
Productmajor facilitator transporter 
Protein accessionYP_591197 
Protein GI94969149 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.74746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGGTC CCACGTCCAT CTCCATGCGG TCGTACTGGC GACTGTTGCG CAACAACCGC 
AACTTCCGTC GTCTCTGGAT CGCGCAAGTC GTGAGCGAGA CCGGCGACTG GTTCTACATG
GTCGCGCTTT ACGCGATGCT GCTGGAATTC ACTGGGCGCG CGGAGGTCCT CGGCATCGCG
TTCGTACTGC AGGTGCTTCC GCAGGCGCTC ACCGGACCCA TCGCAGGCGT TATCAACGAT
CACTTCAGCC GCAAACGAGT GATGGTCTTC ACCGACATCG CGCGATTCCT GATCATTAGT
TGCGTTCTCT TCATTCGCTC CGCCAGCCAG GTCTGGATGA TCTATCCCCT TCTCTTTATT
GAAACCGTGA TGTGGGGCTT GTTCGAACCG GCGCGTAATT CCGTGATTCC CAATGTTGTG
AGCGAAGAAG ATGTGATCGT CGCCAACACC GTCAGCTCGA CGACCTGGTC TGTAAACCTG
TTTCTCGGCG CCGCGCTCGG TGGCATCGCG GCCGTATGGC TAGGCCGGGA CCTTACCATC
ACTCTCGATG TAATGACGTT CCTCGTCTCA GCATGGTTAA TCGCCGGGAT GAAATTCCAG
GAACCGCACC TTCAAGGTCT TGAACACATC CGACTACGCG ATGCGATCAA CTTTGCGCCC
ATGTTGGACG GCTTCCGCTA CATCGCTCGC CAGCCACGCA TGCTGACTAC CGTTCTTGTG
AAAGCCGGCA TGGGTCTGAG CGGAGCAAGT TGGGTGCTCT TCCCGATTCT CGGCAGGCAG
GTATTTCCGA TCTTCCGTGC CGGATTCACG ACTGAGAAAG CAGCCCTGGC CGGGATCAGC
GCGCTTATGG CCGCCCGCGG ACTCGGTTCT GCCCTCGGGC CGGCACTTGG CGCACCATGG
GCTCAACAGA ATTTCCGACG CCTGCGCTAC GGCATCTTCC TCGGGTTCCT TGCTTCAGCT
GCGGGTTACT GGGCCTTGGC CTTCACGCAT ACCGCGTGGA TTGCTTATCT CGAAATCATT
GGGTCGCACG CCGGCAGCGC GGTCGTCTGG GTGTTCTCCA CCACGCTTCT CCAACTGATG
AGCGAAGACA AGTTCCGGGG CCGTCTTTTC TCCGCCGAAC TTGCATGCTG CACCATCACG
CTCGCGGCCA CGTCCTTTGC CGCCGGGTAC GCTCTCGATC GCGGAGTCGC TCTGAATACA
GTTCTCTTTT GCACGGGCCT GATCATCGCC GTCCCGTGGC TGCTGTGGGG AGCAGTCGGA
TTAAAGAAAG ATTAA
 
Protein sequence
MPGPTSISMR SYWRLLRNNR NFRRLWIAQV VSETGDWFYM VALYAMLLEF TGRAEVLGIA 
FVLQVLPQAL TGPIAGVIND HFSRKRVMVF TDIARFLIIS CVLFIRSASQ VWMIYPLLFI
ETVMWGLFEP ARNSVIPNVV SEEDVIVANT VSSTTWSVNL FLGAALGGIA AVWLGRDLTI
TLDVMTFLVS AWLIAGMKFQ EPHLQGLEHI RLRDAINFAP MLDGFRYIAR QPRMLTTVLV
KAGMGLSGAS WVLFPILGRQ VFPIFRAGFT TEKAALAGIS ALMAARGLGS ALGPALGAPW
AQQNFRRLRY GIFLGFLASA AGYWALAFTH TAWIAYLEII GSHAGSAVVW VFSTTLLQLM
SEDKFRGRLF SAELACCTIT LAATSFAAGY ALDRGVALNT VLFCTGLIIA VPWLLWGAVG
LKKD