Gene Acid345_2643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2643 
Symbol 
ID4072052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3117398 
End bp3118597 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content60% 
IMG OID637984660 
Productmajor facilitator transporter 
Protein accessionYP_591718 
Protein GI94969670 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.238963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTTCAC TTCCGGCCTT CCGCACGATC TGGATGGGGC AATTCGTCAG CATCTTTGGC 
GATTTCGTCG CGATCTTCGC AGTCGTCAGC ATGATTACGT TTCGCTGGCA CGGCTCGGCG
ACGCAAGTGA CCATTGCGAT TACGCTCGCC CTCGTGCCGC TGGCAATAAT CGGGCCTCCG
GCGGGCGTTC TCGTGGACCA CTGGAACGTG AAGCGGGTGA TGATTGGCAG CGACCTGTCG
CGAGCGGTGA TCGCCGTCCT TCTGGCGTGG ATGACGAACT CCTGGCAGAT CGGCATGGTG
TTGTTCGCAC TGGGTTGCGC GTCGAGTCTG TTCGTGCCGG CGCAGTCGAT TGCGGTGCGG
ACGCTGATTC CGCGCGAGCG CTTGTTGCAA GCGAATGCCA TGCTGATGCA GGCGTTTTAC
CTGATCCGCA TTATTTCGCC GCTGGTAGCG GCGGCTATTG TGAGCGCGCT CTCGGAGAAG
GCGTGTTTCT ACATTGATTC GGGAAGCTTC GTGTTCTCGG CGCTAATGAT TGCGACGCTA
ACGATTCATC GTCCACCGCG CGAGGGCGCG GATAAGACAC TAAGCGGTTT GAGCAAGGAT
TTCGCGGAAG GGAACCGGTT CATCTTCACC CATCCGGGAC TGTCGTTTGT GTTCCTCGCC
ATGGCGATTG CGATGTTCGT GATGAGCTCG TTCATGCCGC TGATATCGAT CTTTGTGCGC
GATGTGCTGC ACGGAGCAAC GCGGACCTAC GGCGTGGTGA GCTCGTGCGT CGGGTTCGGC
ATGATTCTCG GCACCACGCT GGTGACGAAG ATCTCGCGAG GCAAGGCGCG GCCGGGCGTG
GTGGTGATGG GCCTGCTATC GCTGGGAGTA GCGACGGCGG TGCTGGCGAC TTCGCGAATT
CCGTTCCAGG CGGGCGTGAG CACTTTTCTC ATGGGCTTCT CGATTGCAAT GGTGCTCATT
CCGGCGCAGA CGATGTCGCA ACAGGAAACG CCGCCGCAGA TGGTAGGCCG CGTGAGCAGC
ACGTTCATGT CGATGATCTC GATTGCGCAG GTGTTCGGGT TGCTCCTTAG CGGCTCGGCG
GCGCAACGGC TCGGGATGCA GCGGCTGTTT GCCGTGTGTT CGGCAGTACT GGTGGTCATT
GCCGCGGCAG GCTGGATGTT TTTACGCAGC CGCCGGCAAC CGGAAGCAGC AGCTGCTTAG
 
Protein sequence
MFSLPAFRTI WMGQFVSIFG DFVAIFAVVS MITFRWHGSA TQVTIAITLA LVPLAIIGPP 
AGVLVDHWNV KRVMIGSDLS RAVIAVLLAW MTNSWQIGMV LFALGCASSL FVPAQSIAVR
TLIPRERLLQ ANAMLMQAFY LIRIISPLVA AAIVSALSEK ACFYIDSGSF VFSALMIATL
TIHRPPREGA DKTLSGLSKD FAEGNRFIFT HPGLSFVFLA MAIAMFVMSS FMPLISIFVR
DVLHGATRTY GVVSSCVGFG MILGTTLVTK ISRGKARPGV VVMGLLSLGV ATAVLATSRI
PFQAGVSTFL MGFSIAMVLI PAQTMSQQET PPQMVGRVSS TFMSMISIAQ VFGLLLSGSA
AQRLGMQRLF AVCSAVLVVI AAAGWMFLRS RRQPEAAAA