Gene Acid345_2910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2910 
Symbol 
ID4071211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3455259 
End bp3456584 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content60% 
IMG OID637984928 
Productmajor facilitator transporter 
Protein accessionYP_591985 
Protein GI94969937 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTTG TGGCGCAACG TTCCATAAAC CGCCCGCGGC CCATTCCATC GCTGCGCTGG 
TGGATAGGCG GGCTGCTGTT CGTTTCCACG GCGATCAACT ACCTCGACCG TCAGACCCTC
TCGCTTCTCG CGCCTTACCT GAAATTGGAG TACCACTGGA CGAACACCGA CTACGCAAAC
ATCGTGATCG CGTTCCGTAT TGCATACACT ATCGGCCAAA CCGCGTGCGG CCGCTTTGTG
GATAAAGTCG GCACCCGACG CGGCCTCTCC ATCACGGTTG CGTGGTATTC GCTGGTATCG
CTGCTGACTT CCTTCGCCCG CGGCTTCGCG AGCTTCGCGG GGTTCCGCTT CTTATTAGGA
GCAGGCGAGG CCGCGAATTG GCCGGCCGCG ACCAAGGCCG TGTCTGAATG GTTTCCCGAG
AAAGAACGCG CCTTGGCAAC CGCCCTGTTC GACAGCGGAT CGTCCATCGG CAGCGCGATT
GCGCCTTTCG TTGTGCTTTC CATCTACTTC CGGTGGGGAT GGCGTCCGGC GTTCATGATT
CCCGGCGCGC TCGGCTTTGT CTGGCTCATC GCGTGGCGAT ATTTGTACCA TCCACCGCAG
CAACACCCTC GCGTGAGCGC AGCCGAGTTG CAGTACATCA CCGCCCCGGC GGGTGAGAGC
GCAGCATCAA CGGACTTGCG CTCGCCACGC TGGCGCGAAT TGCTGAAGTT CCCGCAGACA
TGGGCGATCA TTGTCGCCAA GAGCTTCACC GATCCCGTTT GGTTCTTCGT CACCGATTGG
TTCCCGATCT ACCTGGTCGC GAAAGGCATC ACTCTGAAGA GCAGCCTGGT TGCGATTTGG
ATTCCGTTCC TCGCAGCCGA CCTCGGCAAT TTTTTCGGCG GCGGAGTTTC GGGTTATCTC
ATCCAGCGCG GATGGACTGT AGGCAAGGCC CGAAAAGCAC TCGTCGTCTT CGGCGGAATC
GGCGTCCTCG CGCTCATTCC AACCGTCCTC ACGCAAAGTC TATTTGCGAT CTCCGGATTG
TTCGCGATCG CCACCTTCTC ATACTCCGTG TTCTCCACGA TGGCAATCGT CCTGCCGTCC
GACGTCTTCC ACAGCGACTC GGTTGCGACT GTCAGCGGCT TCAGCGGTTC GGGCGCAGGA
ATCGGCACCA TCATCGCCTT CGAACTGGTC GGCCACTTCT CCGACGCGCG GAGCGCGCAG
GGCGTCCACT CCTTCGATCC AATCCTGATT GTTGCCGGGA TGGTTCCCTT CGTCGGGATG
CTGCTGGTGC TGGCTCTGCT GCGCAACAGC AAAGCAACGG AAGAGGGGTA CGCACGGCCG
ATATAA
 
Protein sequence
MSVVAQRSIN RPRPIPSLRW WIGGLLFVST AINYLDRQTL SLLAPYLKLE YHWTNTDYAN 
IVIAFRIAYT IGQTACGRFV DKVGTRRGLS ITVAWYSLVS LLTSFARGFA SFAGFRFLLG
AGEAANWPAA TKAVSEWFPE KERALATALF DSGSSIGSAI APFVVLSIYF RWGWRPAFMI
PGALGFVWLI AWRYLYHPPQ QHPRVSAAEL QYITAPAGES AASTDLRSPR WRELLKFPQT
WAIIVAKSFT DPVWFFVTDW FPIYLVAKGI TLKSSLVAIW IPFLAADLGN FFGGGVSGYL
IQRGWTVGKA RKALVVFGGI GVLALIPTVL TQSLFAISGL FAIATFSYSV FSTMAIVLPS
DVFHSDSVAT VSGFSGSGAG IGTIIAFELV GHFSDARSAQ GVHSFDPILI VAGMVPFVGM
LLVLALLRNS KATEEGYARP I