Gene Acid345_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3056 
Symbol 
ID4071963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3631655 
End bp3632911 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content58% 
IMG OID637985075 
ProductNa+ dependent nucleoside transporter 
Protein accessionYP_592131 
Protein GI94970083 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1972] Nucleoside permease 
TIGRFAM ID[TIGR00804] nucleoside transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCC TGACCGGTGT TCTCGGTCTG CTGGTTTTTC TGAGCCTGGC TTATCTCTTC 
TCGACCAACC GTCGCGCGAT CAAGTGGCGG ACGGTGATCA TCGGCCTTAT TCTCCAAATA
CTGTTTGCGA TCTTTGTGCT GCGGGTGCCG ATCGGGCAAC GCATCATGCA AATAGGCGGC
GACGGCGCTA AGAAGTTGCT GTCGTTCTCG TTCGCCGGAT CGAGCTTTGT GTTTGGTGAT
CTTGGGGCGA GCGGCGGGAA GTACGGGTTC TTCTTTGCGT TCCAGGTGTT GCCGGTCATT
ATTTTTATCG CGGCGTTCTT CGCGATCCTC TATCACTACG GAATTATGCA GTTCATCATT
CGCAATGTTG CGAAGGTGAT GATGCGCTTT ATGGGCGCGA GCGGCGCCGA GTCGCTGAAT
GTGGCTGCCA GCATTTTCAT GGGGCAAACG GAAGCGCCGC TGACGATCCG TCCGTTCCTG
CCAAAACTGA CCCAGAGCGA ACTAATGGTG GTGATGACGA GCGGCATGGC GCATGTGTCT
GGGGCGATCA TGGGCGCGTA CATCCTGCAG GGGATTGAGG CGAAGCACAT CCTCGCGGCG
GTGATCATGA CGGCACCGGG AACATTCGTG ATCGCCAAGA TGCTGGTGCC GGAAACAGAG
ACACCACTAA CCGCGGGACG CCTGGAGGCG ACGACCGAAG AGGAACTCAC AGGGGAAGAG
AAGCACGCGA ACGTACTGGG CGCAGCGGCG AAGGGAACGA CCGACGGATT GTGGCTGGCG
CTAAACGTGG GGGCGATGTT GATCTCGTTT CTCGCGCTGA TTGCGCTGAT CAACGGCGTT
CTTGGCGGCA GCCACAACTG GCTGGCGGCG CACGGCTTCA AGTGGTTTCC CGACAAGTTG
GAGACCATCA TCGGGGCGAT TTTTGCGCCG TTCGCGTGGC TGATTGGAAT TCCTTGGCGC
GATTGCTTGA ACGTCGGGAA CCTGCTCGGC ACGCGCATGG TGCTGAATGA ACTGGTGGCT
TTCACCATGC TTGGACAACA AAAGGCGGGA CTCGATCCGC GGTCGTTCAC GATTGCGACG
TTCGCACTGT GCGGCTTCGC GAACTTGAGC TCGGTGGGTA TTCAGATCGG CGGATTGGGT
GCGTTGGCCC CGAACCGCAG AAACGACCTT GCTAGATTGG GTTTTCGCGC GATGTTGGCC
GGAACGATGG CGAACCTGAT GTCGGCGTCA ATTGTGGGGA TTCTGTTGCA TGCTTAA
 
Protein sequence
MERLTGVLGL LVFLSLAYLF STNRRAIKWR TVIIGLILQI LFAIFVLRVP IGQRIMQIGG 
DGAKKLLSFS FAGSSFVFGD LGASGGKYGF FFAFQVLPVI IFIAAFFAIL YHYGIMQFII
RNVAKVMMRF MGASGAESLN VAASIFMGQT EAPLTIRPFL PKLTQSELMV VMTSGMAHVS
GAIMGAYILQ GIEAKHILAA VIMTAPGTFV IAKMLVPETE TPLTAGRLEA TTEEELTGEE
KHANVLGAAA KGTTDGLWLA LNVGAMLISF LALIALINGV LGGSHNWLAA HGFKWFPDKL
ETIIGAIFAP FAWLIGIPWR DCLNVGNLLG TRMVLNELVA FTMLGQQKAG LDPRSFTIAT
FALCGFANLS SVGIQIGGLG ALAPNRRNDL ARLGFRAMLA GTMANLMSAS IVGILLHA