Gene Acid345_1540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1540 
Symbol 
ID4072931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1882877 
End bp1883938 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content54% 
IMG OID637983549 
ProductABC spermidine/putrescine transporter, periplasmic ligand binding protein 
Protein accessionYP_590616 
Protein GI94968568 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.713242 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.652305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAC GCTTTGCCCT CGCGCTGCTG GTTGTCTGTA CTTTGTTTCT CGGCTCATGC 
TCGAAGAAAG TGCCGACGCT CAACCTTCTC GTATGGGAGG GCTATGCCGA TCCCTCTTTC
GTGAAGGGCT TCGAAGAGAA ATATCACTGC AAGGTCGCAG CAAGCTACAT GGGGTCAAGC
GATGAACTCG TCGCCAAGCT GCGTGGCGGT AGCGCGTCGA ACTACGATGT AATCTCGCCG
TCCAGTGACG TCGCGACTAG TATCGCGAAG AATGGTCTCG CCGCTGAAAT CGACACTACG
CAAATCCCCG ACTACAACAG TCTCTCGCAG CAGCTTCGCG ATCTCCCTCT CGTCAAAGCA
AATGGCAAGA CCTATGGAGT CCCATTCATG TGGGGCCCGA ATCCTCTGCT TTACGACACT
TCTTATTACA AGACGGCGCC GCAAAGCTGG GCTGACTTGT GGGACCCAAA GCTGAAGTCG
AAGATCTCCG TCTGGGATGA TCTCTCCACC GTCTATATGG CGGCACAAGT GCTCGGCTAC
GACAAGCCCG ATCCAGCGCA CCTCTACAAT CTCACCGACG ACGAACTCGA AAAAGTCAAA
GCAAAGCTTC TCGAATTAAA GCCAAACGTT CGCAAGATGT GGTCTACCGG TGGGGAACTG
ACCAACCTTT TCCAGAACCA CGAGGTCGTC GCCGCGATGG GCTGGCCGCT GATGACGAAT
CAACTTCATA AGGCCAACTT TCCAATCGGC GAAACCATTC CGAAGGAAAA CACGACCGGC
TGGATCGACC ATTTGATGAT CACCGCCGCC AGCGACAACA AAGATCTCGC GATGAAATTC
CTGGCATACA TGGTCGAAGC GAAAACGCAG AAGGCCGTTA CCGACGTGAC CGGCTACACA
CCGGCCAATC CCACTGCCGC GCAACTCATG ACCGACCAGG AGAAGAAGAG CCTTCACCTC
GCTGATGTCG AGGGCTACCA GAAGCACATC TATTTCTGGC AGGATGTTCC GCGCCGTGCG
AAGTACAACG AAATCTGGAA CCAGGTGAAA GCCGCGCAAT AG
 
Protein sequence
MSKRFALALL VVCTLFLGSC SKKVPTLNLL VWEGYADPSF VKGFEEKYHC KVAASYMGSS 
DELVAKLRGG SASNYDVISP SSDVATSIAK NGLAAEIDTT QIPDYNSLSQ QLRDLPLVKA
NGKTYGVPFM WGPNPLLYDT SYYKTAPQSW ADLWDPKLKS KISVWDDLST VYMAAQVLGY
DKPDPAHLYN LTDDELEKVK AKLLELKPNV RKMWSTGGEL TNLFQNHEVV AAMGWPLMTN
QLHKANFPIG ETIPKENTTG WIDHLMITAA SDNKDLAMKF LAYMVEAKTQ KAVTDVTGYT
PANPTAAQLM TDQEKKSLHL ADVEGYQKHI YFWQDVPRRA KYNEIWNQVK AAQ