Gene Acid345_0324 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0324 
Symbol 
ID4068601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp350987 
End bp352690 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content58% 
IMG OID637982327 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_589403 
Protein GI94967355 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.549745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCAC TCCTTGCGAG TTTCGTAGCG GACCGGTTGG TGAACCTCTC AAGCACCGAC 
CTTGTCATCA TCGTGTTCTA CTTCGCCCTC GTCCTCGCCA TTGGATGGCA GCTGAAGGGC
CAAGCCAAGA CCGGCGAAGA CTTCTTCATG GCCGGACGCG AGATGACCGC GTGGATCGCC
GGCCTTAGTT TCCTCTCCGC AAACCTCGGC TCTCTCGAAC TGATGGGCTG GGCCGGGGCC
GCCTACCAAT ACGGCATCCT CGCCGCCCAC TGGTACTGGA TCGGCGCCAT TCCCGCCATG
ATCTTCCTTG GTCTTGTGAT GATGCCGTTC TATTACATCT CCAAGACGCA CTCGGTGCCG
GGATACTTGA AGCTGCGCTT CGGTGAACCC AGTCGGGCAC TGTCCGCCGT TTCGTTCGCG
TTCATGACGG TGCTGATGAG TGGCATCAAC ATGTATTCGA TGGCGCTGGT GATGAAGGTC
GTTCTCGGCT GGGACATCAA CGTCAGCATC ATAGTTTCGT CCATCACGGT CGCCATCTAC
GTCACCCTCG GCGGCCTGCG TTCCGCCATC TTCAACGAAG TCTTGCAGTT CATTTTGATA
TGGGCAGGTG CGCTTCTCAT CCCCATCATG GGCCTGATCG AAGCCGGCGG CTGGACGAAC
CTCAAGGCGC AAATCACGCG CAATGCCTCA GCGGAGTACA CGCACCTCTG GTCCACGCTC
GGGAAATTCA GCGACAACCC GATGGGCATC AATTGGATCG GCATCGTCTT CGGACTGGGC
GCGATTATTT CGATGGGCTA CTGGACGACG GATTTCCTGG TGGTGCAGCG CGTACTAGCC
GCGAAGGACA TGCGTTCGGC GAAGATGGCC CCGATCATTG GCGCCGCCTT CAAGATGTTG
GTCCCCTTCA TCGTCATTCT GCCCGGACTG CTCGCCCTCG CCGTCCTGCC CATGAAGCTT
GTAGGCGAAA GCCAAGCTAT TGCCACCCAC GGACACAGCT ACAACGAAGT TCTGCCCTTG
ATGCTCGCGC GCTACTGCGG CCCCGGCCTC CTCGGCCTCG GTATCACCGC GCTCATCGCT
GGATTCATGT CCGGCATGGC GGGTAATGTC AGCGCTTTTA CCACGGTGTG GACGTATGAC
ATCTACCGCG CGATGATCAA AAAGGATGCG AGCGATGCCC ATTACGTAAA CATGGGACGT
GCATCAACGA TTGGCGGCGT AATCATCAGC ATTCTCACTG CATACTTCGT GATGAAATTC
GCCAGCATCA TGGATTACGT GCAAGCGCTC TTCAGCTTCT TCATCGCTCC GCTTTTCGCA
ACCGTCGTTC TCGGCATGCT CTGGAAGAGG GCCACGAATG CAGGTGGCTT CTGGGGATTG
CTTGCGGGTA CCGTCTCGTC GGTCGGCATG TACGCCTGGG TGAAGCTCGA TCCCTCGGCG
CTGCGCTACG TCGCAATGTC ACCCGACGCA CAGGCGATGG CGGAGAACAT GTATCGCGCC
CTGTGGTCGT GTCTCATCTG CGCGCTGGTC ACGGTGGTGG TGAGCTACGC CACGAAGCCG
AGGCCCGACG CGGAACTGGT CGGGCTCGTG AAGTCAGTGA CACCAATTCC AAGCGAAGGC
GATGTGCCGA TGTACATGCG TCCTGCGTTC TGGGCCTGCG TAGTCGCCGT CGGATTCATC
ATTCTTCAAA TCATCTTCTG GTAG
 
Protein sequence
MSPLLASFVA DRLVNLSSTD LVIIVFYFAL VLAIGWQLKG QAKTGEDFFM AGREMTAWIA 
GLSFLSANLG SLELMGWAGA AYQYGILAAH WYWIGAIPAM IFLGLVMMPF YYISKTHSVP
GYLKLRFGEP SRALSAVSFA FMTVLMSGIN MYSMALVMKV VLGWDINVSI IVSSITVAIY
VTLGGLRSAI FNEVLQFILI WAGALLIPIM GLIEAGGWTN LKAQITRNAS AEYTHLWSTL
GKFSDNPMGI NWIGIVFGLG AIISMGYWTT DFLVVQRVLA AKDMRSAKMA PIIGAAFKML
VPFIVILPGL LALAVLPMKL VGESQAIATH GHSYNEVLPL MLARYCGPGL LGLGITALIA
GFMSGMAGNV SAFTTVWTYD IYRAMIKKDA SDAHYVNMGR ASTIGGVIIS ILTAYFVMKF
ASIMDYVQAL FSFFIAPLFA TVVLGMLWKR ATNAGGFWGL LAGTVSSVGM YAWVKLDPSA
LRYVAMSPDA QAMAENMYRA LWSCLICALV TVVVSYATKP RPDAELVGLV KSVTPIPSEG
DVPMYMRPAF WACVVAVGFI ILQIIFW