Gene Acid345_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1500 
Symbol 
ID4069247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1825548 
End bp1826894 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content58% 
IMG OID637983509 
Productmajor facilitator transporter 
Protein accessionYP_590576 
Protein GI94968528 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.21091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAC CGCGCCGCAA TTTTTGGCAG ATCGTGAACA TGAGCGTTGG CTTTTTGGGG 
ATCCAGTTCG GATGGAACCT GCAGATGGCC AACATGAGCG CCATCTATGA GTATCTCGGT
GCGCGAGCGG ACCAGATTCC GATTTTGTGG CTCGCGGCGC CGCTCACCGG GCTCATCGTG
CAGCCGCTCA TCGGGCACGC AAGCGATCAC ACATGGGGCA AGCTCGGCCG CCGGCGTCCA
TATTTTCTAA CCGGCGCGAT TCTCAGTTCG CTGGCGCTGA TCCTGATGCC GCGCTCGGGC
GCGCTATGGA TGGCCGCGGG GCTGCTTTGG ATTCTCGACG CGTCGATCAA CATCAGCATG
GAACCGTTCC GCGCATTCGT CGCCGACATT CTTCCGGAAG AACAACGCAC CCGTGGCTTC
GCGATGCAGA GCCTGATGAT CGGTCTTGGC GCGGTGATGG CGAACGTTCT CCCCTACCTG
CTGCTGAAGT TCGGAAACTT GAAGGCCGAC ACGACGGGAT ATGCGATTCC ACTCGCAGTG
CGAATCTCGT TCTACGTCGG CGCGGCTGCA TTCTTCGGCG CGGTAATGTG GACGATCCTG
ACGACGAAGG AGTATCCACC GGACGATCTG GAAGCCTTCC GAAAGAAGAA AGAAAAAAAG
GGCGGCTTGG GATTGGGAGA GATCGTCAAC GCGGTTCGCG AAATGCCCAT GACGATGCGG
CAACTGGCGC CGGTACAGTT CCTTACCTGG CTCGGACTGT TTTGCATGTG GCTGTTCTTC
GGCGTTGCGG TAGCGCGCAA TGTGCTCGGC GCTACCGATG CGAAATCGAA GCTCTACACC
GATGGCATCG CGTGGGGCGG CATCTGCTTT GCGTTCTATT CCGGGGTCAC GTTTGTCTAT
TCGTTTTTCC TGCCCGCAAT TGCAAAGGCC GTTGGACGCC GGCGCGCGCA TAGCCTCTCT
CTGCTTTGCG GCGCGGCAGG ACTGATCTCA GTCGCGTTTA TTCACGACAA GAACTTTTTG
CTGCTCTCGA TGGTCGGCGT CGGCATCGCG TGGGCGAGCA CTCTCGCAAT GCCGTACTCG
ATCCTTGCCG CTTCCCTGCC GCCGGAGCGC ACCGGCGTGT ACATGGGCAT CTTCAACTTC
TTCATCGTGA CGCCAGAAAT TATTGCCTCG CTTGTTTTTG GCTGGGTGAT GGTGCACTGG
CTCAACAATA ATCGCTTATA CGCGGTGATT GCCGGAGGAG TGTTCATGAT TGCGGCAGCG
ATCATGATGC AATTTGTGAC CGATCCAGCG GAGAAGAAGG TACGCGCCAC GGAACCGGAG
GCGGTTGTGG CGAGTCGCAG GGGTTGA
 
Protein sequence
MDKPRRNFWQ IVNMSVGFLG IQFGWNLQMA NMSAIYEYLG ARADQIPILW LAAPLTGLIV 
QPLIGHASDH TWGKLGRRRP YFLTGAILSS LALILMPRSG ALWMAAGLLW ILDASINISM
EPFRAFVADI LPEEQRTRGF AMQSLMIGLG AVMANVLPYL LLKFGNLKAD TTGYAIPLAV
RISFYVGAAA FFGAVMWTIL TTKEYPPDDL EAFRKKKEKK GGLGLGEIVN AVREMPMTMR
QLAPVQFLTW LGLFCMWLFF GVAVARNVLG ATDAKSKLYT DGIAWGGICF AFYSGVTFVY
SFFLPAIAKA VGRRRAHSLS LLCGAAGLIS VAFIHDKNFL LLSMVGVGIA WASTLAMPYS
ILAASLPPER TGVYMGIFNF FIVTPEIIAS LVFGWVMVHW LNNNRLYAVI AGGVFMIAAA
IMMQFVTDPA EKKVRATEPE AVVASRRG