Gene Acid345_2632 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2632 
Symbol 
ID4072041 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3103714 
End bp3104922 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content61% 
IMG OID637984649 
Productmajor facilitator transporter 
Protein accessionYP_591707 
Protein GI94969659 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.127081 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.954997 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCGC AGGCCACTCC CGCGCAGCGC CGCACCTTGC TCGCCGCTGC GCTTGGCTGG 
GCACTGGATG CTTTCGACGT CATGCTCTAC GCGATGGTCG TGGCATACGT CATGCGCGAC
CTCCGCATTG ATAAGCCGAC TGTCGGCCTG CTGAACACAC TCACACTTCT TGCCAGCGGT
ATCGGCGGCC TGCTCTTCGG CTGGATCGCA GATCGCGTCG GCCGCACCCG TGCGCTGATG
CTCAGCATCG CGACTTACTC GATCTGCTCC TTCGCCTCGG GACTTTCAAC TTCGGTGCAG
ATGCTGGCTG CCTGCCGTTT CGTGCTCGGA CTCGGCATGG GTGGCGAGTG GAACACCGGC
GCTACGCTCG TAGCGGAAAC TTGGCCGACC CATCTTCGTG CGAAAGCCAT CGCCGTAGTG
CAAAGCTCGT GGGCATGGGG CTACGCTGCT GCGGCGCTCG TTGCCGGACT CACGCTGCAA
TATTCTCAGA ACTGGCGTTA TGTGTTCTTC GTGGGCATCG CGCCGGCTTT GCTTCTGCTG
TGGATTCAAA AGGAAGTGCC GGAGTCCGAA CTCTGGCAGA AACAGCAAAC GAACAAAGCA
CCGAACGCGA AGCTTTCATC GGAACACGTG CGCAACGCGA TCGTCTTGCT CGCTCTCAAT
TTCTTCGGCC TCTTCGCCTG GTGGGGGCTG TTTACCTGGA TTCCGCCGTA CCTCTCGCTT
CCCGTCGAGC AAGGCGGACG CGGCTTCTCG CAACTTGGCA CCACCGGACT GCTCGTCTTT
CTAAATCTGG TTGGGATGTT CCCCGGCTAC ATCAGCTTCG GGTTCTTCGC CGATCGCATA
GGACGCCGCT GGTCGTTCTT TCTTTATCTT TTCGCTGCCG CCTGCGTAGT TCCGATCTAT
GCAGCTGCGC GACAGCCGTG GCAGATTCTC GTCATGGGAG CGGTTGTCGC GTTCTTCGGA
ACCGGCTTCT TTTCCGGCTC CGGCATTGTT GGCTCAGAGC TATTTCCGAC GCACGTCCGC
GCTCGCGCGC TCGGACTGAC CTATAACGGC GCACGCATGC TGAGCTGCGT TGCACCGTAC
GTCATCGGCT CGCTGAGCAT GCACCGCGGA CTGAGCGGAG CCTTCGTGGT TTGTGCGGTG
GGATTCCTGC TGGCGGCTTT CACCGCGCTC CTGTTACCAG AAACGCGGGG CCGCGAACTT
GCGAATTGA
 
Protein sequence
MFSQATPAQR RTLLAAALGW ALDAFDVMLY AMVVAYVMRD LRIDKPTVGL LNTLTLLASG 
IGGLLFGWIA DRVGRTRALM LSIATYSICS FASGLSTSVQ MLAACRFVLG LGMGGEWNTG
ATLVAETWPT HLRAKAIAVV QSSWAWGYAA AALVAGLTLQ YSQNWRYVFF VGIAPALLLL
WIQKEVPESE LWQKQQTNKA PNAKLSSEHV RNAIVLLALN FFGLFAWWGL FTWIPPYLSL
PVEQGGRGFS QLGTTGLLVF LNLVGMFPGY ISFGFFADRI GRRWSFFLYL FAAACVVPIY
AAARQPWQIL VMGAVVAFFG TGFFSGSGIV GSELFPTHVR ARALGLTYNG ARMLSCVAPY
VIGSLSMHRG LSGAFVVCAV GFLLAAFTAL LLPETRGREL AN