Gene Acid345_4206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4206 
Symbol 
ID4072165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4983534 
End bp4985006 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content57% 
IMG OID637986237 
ProductNa+/solute symporter 
Protein accessionYP_593280 
Protein GI94971232 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.153775 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0109856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACC TAATTTATGC CGTGGTACTG GGCATTATCG TTGTCGCGTT GCTGGCCGTC 
AGCCTCTCGC AGCTTCGCAA GGTAAAAACA AAAGCAGATT ACCTCGTCGC TGGACGATCG
CTCCCGGCAT ATGTGCTGGT GGCGACACTG CTGTCGTCGT GGATCGGTGC GGGCAGCCTG
TTCGCAGGCG CTGAAAATGC ATTCCGCAAT GGCTTCGCAG GACTATGGCA GTCTGCCGGC
GGATGGTTCG GGCTGCTGGT TATTTACTTC GTGGCACCAC GCGCCCGAAA GTTCGCGCAG
TACACCATTC CCGACCTCAT CGAAACGCGC TACAACACCA CAGCCCGCGT GCTTTCAACG
ATTGCGATCC TCTTTGCGTA CACGGCGATC ACTTCGTATC AATTCCGCGC GGGCGGGAAC
ATCCTTCACC TGATCTTCCC TGAAGTGAAC CACGAGGTCG GGACGTACAT CATCGCTGTA
TTTGTGATTG CGTTCACGGC GGTCGCAGGC ATGGCGTCCG TTGCTTACAT GGATGTGGTC
ATAGGAGTGT TGATCACGGT GATTGGATTC ATCGCTGCCC CGGTGCTGCT CAACCGCGCT
GGCGGTTGGG CCGGATTGCA CCAGGTGCTT CCCGCAACCC ACTTCACACT GCTCGGTGAG
TTCGGCTACG TAGACGGGCA ATCGCAGGGA ACGCTCGCTG GATTGCTGAA AGCCTTTGAG
TATTTCCTGC CGACGTGCCT CCTCATGCTC GGCAACCAGA GCATGTACCA GAAGTTCTTC
TCGGCAAAGT CGGAGAAAGA CGCGCGACAA GCCGTCGTCG GATGGGTCTT CGGGACGTTT
ATTCTTGAGA CCCTGATCGC TGCCATCGCT GTGTTTGGGT CGGCGATTGT GTGGGTGCAG
TATCACCAGC ACCAGGTTGA CCTGGAGCCG CATAACATCA TCCCGTATTC GGCGCTGCAC
TTTCTGCCGA GAGTTGTGGG CGCACTGCTG ATGGGAGCGG TGTTCGCGAA GGTAATCTCG
ACCGCGAATA ATTATCTCTT CTCGCCGTCG ACGAACTTGA TCAACGACAT CTACACGCGG
TTCATCAACA AAGAAGCCTC CAACAAAAAC ATTCTGTTTG TTTCTCGCTT TCTCGTCTTA
GGGTTAGGCG TGTGGGCACT GGTGCAGGCG GTGCATCTGA CGTCGGTGCT GGAGAAGGCG
ATGTATGCCT ACACGATCTA CTCGGCGGCG ATAACGCCGG TGGTGTTGGC CGCGTTCTAC
TCGAAGCGTG TGACCGCACC GGCAGCGGTC ACTTCAATTG CGCTTGGGAC AGCGGTGACG
GTGTTCTGGG ACCTCGGCAA GAACTTGCTG CCTCCGGCGC TCGCTCAGCG CGATGCAATT
TTCCCGGCAC TTGTCGTTTC GCTGCTCTCG TTGCTGCTTG TGACCCTTGT AACTCCGCCA
CCATCGAAGG AGCAACTCGC GCCTTTCTCC TAG
 
Protein sequence
MSNLIYAVVL GIIVVALLAV SLSQLRKVKT KADYLVAGRS LPAYVLVATL LSSWIGAGSL 
FAGAENAFRN GFAGLWQSAG GWFGLLVIYF VAPRARKFAQ YTIPDLIETR YNTTARVLST
IAILFAYTAI TSYQFRAGGN ILHLIFPEVN HEVGTYIIAV FVIAFTAVAG MASVAYMDVV
IGVLITVIGF IAAPVLLNRA GGWAGLHQVL PATHFTLLGE FGYVDGQSQG TLAGLLKAFE
YFLPTCLLML GNQSMYQKFF SAKSEKDARQ AVVGWVFGTF ILETLIAAIA VFGSAIVWVQ
YHQHQVDLEP HNIIPYSALH FLPRVVGALL MGAVFAKVIS TANNYLFSPS TNLINDIYTR
FINKEASNKN ILFVSRFLVL GLGVWALVQA VHLTSVLEKA MYAYTIYSAA ITPVVLAAFY
SKRVTAPAAV TSIALGTAVT VFWDLGKNLL PPALAQRDAI FPALVVSLLS LLLVTLVTPP
PSKEQLAPFS