Gene Acid345_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3034 
Symbol 
ID4071941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3600303 
End bp3601601 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content59% 
IMG OID637985053 
Productmajor facilitator transporter 
Protein accessionYP_592109 
Protein GI94970061 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.201695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATCG CGGCTCTCTC CATGACAACC ATACCGTCAG AGCACAAGCG GCTCCCGGGG 
TTTCTTATTG TCCTGCTGGT GTTGATTGGC GTCAGCGTTT TTATCAACTA CATCGACCGC
GGCAACCTGT CGATTGCAGC TTCGATGGTG CAGGACGAAA TGCATATCAA CCCCGCGCAA
CTGGGCGTGC TGCTTTCGGC CTTCTTCTGG ACCTACGCTT TGCTACAGCC TTTGTATGGC
TGGCTGGCGG ACCGCGTAAA TGTGTATTAC TTGTTCGCGG TCTGTTTCGC GGCGTGGTCC
GTGGCCACAG CGGCAACCGG CCTGGTGCAC ACGTTCGTTG CCTTATTCGC GTTGCGGCTG
ATTGTCGGGA TGGGGGAAGC GGTGTCGTTC CCGGCGTACT CGAAGATCAT TGCCTTGAAT
TATCCAGAAG AACATCGCGG CGTGGCGAAC AGCGTGCTCG CCATGGGGTT GGCGGTTGGT
CCGGGATTCG GGATACTGCT CGGCGGCACC CTGATGGCGC GGTTCGGATG GCGGCCATTC
TTCATCATCC TCGGCTTGGG CAGCATGCTC TGGATTCCTC TGTGGTTAAA GTGGTCGCCG
AGCAGGAACC TCGTTCCTGC ATCCAGCAAA CAATCTTCGC CAAGCTTGCT GGAGTTTGTC
TGCCTGCGTT CGGCGTGGGG AAGTTGTATC GGACTGTTTT GCGGCAACTA TGTGAACTAC
TTCCTGCTCA CCTGGCTGCC GTACTACCTG TTGCGCGAGC GGCATTTTTC GATGGCGCAG
ATGGCGCGCA TCGGAGCAAC CGGTTACTTC GGTGGAGCGG TGTGTGCGGG AATCGCGGGT
TGGCTCTCGG ACCGGTGGAT ACGGTCCGGT GCAACGACGA CCGTTGTCCG AAAAACCTTC
GTTGCCGGCG GCTATGTGTC GTGTGCCACC TTTCTCGCTC TCGCTGCCTT CGTTCCGGGC
GCGCGCGGAT CCACGATTCT GTTGTGGCTA GCCATGGCGA GCTTCGGTGT GAGCGCCTCC
AATATCTGGG CGATCCCGCA GACGCTCGCG GGCTCGCAAG CCGCTGGACG CTGGGTGGGA
TTCCAGAATT GCTCTGGAAA TATGGCTGGC GTCGTAGTGC CTGTAGTAAC TGGATTCGTG
GTGCGACAGA CGGGAAGCTT CCGCTCGGCC TTCGTATCGG TGGCAGTCGT TCTGTGCATC
GGCGCAGCTA CATGGACGTT CGTCGTAGGG AAGATCGAAC AGGTGAAATG GGAACGATCG
GCGGAATTAC TCGTGGCCGA AGCTTCAGCG TCGCGTTGA
 
Protein sequence
MRIAALSMTT IPSEHKRLPG FLIVLLVLIG VSVFINYIDR GNLSIAASMV QDEMHINPAQ 
LGVLLSAFFW TYALLQPLYG WLADRVNVYY LFAVCFAAWS VATAATGLVH TFVALFALRL
IVGMGEAVSF PAYSKIIALN YPEEHRGVAN SVLAMGLAVG PGFGILLGGT LMARFGWRPF
FIILGLGSML WIPLWLKWSP SRNLVPASSK QSSPSLLEFV CLRSAWGSCI GLFCGNYVNY
FLLTWLPYYL LRERHFSMAQ MARIGATGYF GGAVCAGIAG WLSDRWIRSG ATTTVVRKTF
VAGGYVSCAT FLALAAFVPG ARGSTILLWL AMASFGVSAS NIWAIPQTLA GSQAAGRWVG
FQNCSGNMAG VVVPVVTGFV VRQTGSFRSA FVSVAVVLCI GAATWTFVVG KIEQVKWERS
AELLVAEASA SR