Gene Acid345_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3930 
Symbol 
ID4071313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4643219 
End bp4644529 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content57% 
IMG OID637985956 
Productglucose/galactose transporter 
Protein accessionYP_593004 
Protein GI94970956 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0738] Fucose permease 
TIGRFAM ID[TIGR01272] glucose/galactose transporter 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.121013 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.215751 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT CAGCGCCGCA AAGTTCCAGC GCCAATAACA GCCAGGACGC TCCCGTAAAA 
ACCAATTACA GCGCCATGGC CATGGTGACC ACGCTGTTCT TCATGTGGGG ATTTCTCACC
TGTCTGAACG ACATTCTCAT CCCCCACCTG AAGGGACTGT TCGACCTGAC TTACATGCAG
GTGATGCTGG TGCAGTTTGC CTTCTTTACC TCGTATTTCG TGTTCTCGAC GCCCTCAGGC
AAGCTGATTG AAGCGGTCGG ATACAAGCGT GCGATGGTTG CCGGACTCTG CACCATGGGA
ATTGGCACGA TTCTGTTCGT GCCCGCGGCG ATGGTGCCTT CGTATCCACT GTTCCTTACG
GCCCTCGTGA TTCTTGCCGG TGGCATGACG ATCCTGCAGA CGTCGGCGAA CCCGTACGTC
GCGGTGCTCG GACCGCCGAA CACGGCGTCG AGCCGCTTGA ATCTGACCCA GGCGTTCAAT
TCGCTGGGAA CGACGGTGGC GCCCTACTTC GGCAAAATCT TGATTCTCGG AGCGGTGGCC
GCTCCCGTTG CCGTGGAAGT TTTCCGCAAG ATGGGTGACG CCGAGCGGCA CGCATATCAG
GTCGACCAGG CGAGCGCCAT CAAGAACCCG TATATCGGGC TGGCGATTTC GTTGTTTGTA
TTGGCTGTGA TCATGGGCTT GTTCAAGCTG CCGGTGATCC GATCGGTCGA GGGGCATGCC
GAGGCGGGAG ATTCGATCTG GCAACATAAG CAGCTGGTGC TCGGTGCAAT GGGGATTTTC
GTTTACGTTG GCGCGGAGGT TTCGATCGGC AGCTTCCTCG TGAACTACAT GCACGAACCG
AACATCGGGA ACCTTACGCT CGAAAAGGCC GCTGGATATC TTACCTACTA CTGGCTCGGC
GCGATGGTAG GACGCTTCAT AGGTGCTGCA CTCATGGGGT GGTTGAAACT TAACCCAGGC
AAGTATCTCG GTGTCAACGC ACTCTTTGCA GCCCTGCTAG TGATCGCATC GATGCTCAGT
GTCGGACACA CGGCCATGTG GGCCATGCTC GCGGTCGGCT TCTTCAATTC GATCATGTTC
CCCACCATCT TCACTTTGGG AATTGATGGT CTGGGTCACC TCACCGGAAA AGGCAGCGGC
CTCCTCATCG CGGCGATCAT CGGCGGAGCG ATTATTCCAC TGGTTCAGGG CTATTTCGCC
GATCGCATGG GCATTCACCA TGCATTCTTC TTGCCGGTGC TCTGTTACGC CTTCATCGCG
TACTACGGTT TTGTCGGCTC AAAGCATGGT CATGAGGCCG TGGCGGGTTA A
 
Protein sequence
MAISAPQSSS ANNSQDAPVK TNYSAMAMVT TLFFMWGFLT CLNDILIPHL KGLFDLTYMQ 
VMLVQFAFFT SYFVFSTPSG KLIEAVGYKR AMVAGLCTMG IGTILFVPAA MVPSYPLFLT
ALVILAGGMT ILQTSANPYV AVLGPPNTAS SRLNLTQAFN SLGTTVAPYF GKILILGAVA
APVAVEVFRK MGDAERHAYQ VDQASAIKNP YIGLAISLFV LAVIMGLFKL PVIRSVEGHA
EAGDSIWQHK QLVLGAMGIF VYVGAEVSIG SFLVNYMHEP NIGNLTLEKA AGYLTYYWLG
AMVGRFIGAA LMGWLKLNPG KYLGVNALFA ALLVIASMLS VGHTAMWAML AVGFFNSIMF
PTIFTLGIDG LGHLTGKGSG LLIAAIIGGA IIPLVQGYFA DRMGIHHAFF LPVLCYAFIA
YYGFVGSKHG HEAVAG