Gene Acid345_4307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4307 
Symbol 
ID4071880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5117565 
End bp5118857 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content57% 
IMG OID637986340 
Productmajor facilitator transporter 
Protein accessionYP_593381 
Protein GI94971333 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.622363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA CGGCTCTCCA TCCAGGCGAT TACGGCGCTC CGCCGATCGG ACGCGTGCGC 
TGGACCATCT GCCTGATGCT GTTTCTCGCA ACTTCAATCA ACTACGTTGA CCGCCAGGTC
ATCGCAATCC TCAAGCCGAC ACTGGAACAG TCGATCGGGC TGACGGAAGT GCAGTACGGC
TACATCGTCG ACGCATTTCA GATCGCCTAC GGCATCGGGC TTCTTGCGGC CGGACGCTTG
ATGGACCGCA TCGGAACGCG AATCGGCTAC ATGCTGGTGA TGGCTTTCTG GAGCCTGTCA
GCCATGGGCC ATGCATTGGC GAGCACTGCG TTCGAGTTCG GCATCGCGCG ATTCTGCCTT
GGACTTGGCG AATCGGGGAA TTTCCCCGCT GCTCTCAAAA CCGTCGCGGA GTGGTTCCCT
CAGAGTGAGC GTTCGCTGGC GACAGGAATT TTCAATTCCG GCGCAAACGT CGGCGCGGTT
CTCGCGCCGT TGATCGTTCC ATGGATAACC CTCCGGTACG GATGGCACGC GGCATTTCTT
GCTACAGGTG TATTCAGCGC GATATGGATC GTGTGGTGGT ACGCGCGCTT TCGCCAACCG
AAGGAGCACC CTAAGCTTGG CGACGCTGAA CTCCAGCACA TCTATAAAGA CGCAGCGGTG
GAGATGGGCC CGCAGGTTCC GTGGGGACGA CTATGGGGCC AACGCCAGAC ATGGGCGTTC
GGGCTTGCGA AGTTTTTTAC CGATCCCATC TGGTACTTCT ATCTCTTCTG GTTGCCTTCG
TACTTCAGCG CGAAGTTTCA TTTGAACCTC TCGCATATCG GACTACCTCT GATCATCATC
TATAACGTGT CGGCGGTCGG AAGCATCGCG GGAGGGTGGC TGCCTGCTCC GTTCCGCAAA
TTGGGATTCA CGCAACAACG CGCGCGACTC TCGGCTATGC TGGTTTGCGC GATTTTGGTA
GTGCCAATTT TTATCGCGAG CTCCGTGAAC TCAGTTTGGA TTGCGATCGC GTTGATCAGT
GTGGCTGCCG GCGCGCACCA GGGCTGGTCG GCAAACTTGT TCACCACGCC GTCAGACATG
TTTCCGCGGA GTGCAGTCGG CTCGGTGGTC GGCATCGGCA ATATGATCGG GTCAATCGGG
AGTGCCATCT TCGCGTTCTA CGCCGGACAC GTCCTGCAAC TCACCCATAG CTACGCGAGT
TTGTTTACCA TCGCCGCAAG CGCGTATCTT GTTGGACTGA TGATTTTGTA TTTTCTTTCA
TCGGGGTTGC GTTCAGCGGA GATCGCCGCA TGA
 
Protein sequence
MADTALHPGD YGAPPIGRVR WTICLMLFLA TSINYVDRQV IAILKPTLEQ SIGLTEVQYG 
YIVDAFQIAY GIGLLAAGRL MDRIGTRIGY MLVMAFWSLS AMGHALASTA FEFGIARFCL
GLGESGNFPA ALKTVAEWFP QSERSLATGI FNSGANVGAV LAPLIVPWIT LRYGWHAAFL
ATGVFSAIWI VWWYARFRQP KEHPKLGDAE LQHIYKDAAV EMGPQVPWGR LWGQRQTWAF
GLAKFFTDPI WYFYLFWLPS YFSAKFHLNL SHIGLPLIII YNVSAVGSIA GGWLPAPFRK
LGFTQQRARL SAMLVCAILV VPIFIASSVN SVWIAIALIS VAAGAHQGWS ANLFTTPSDM
FPRSAVGSVV GIGNMIGSIG SAIFAFYAGH VLQLTHSYAS LFTIAASAYL VGLMILYFLS
SGLRSAEIAA