Gene Paes_0520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_0520 
Symbol 
ID6459326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp563355 
End bp564992 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content52% 
IMG OID642724519 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002015223 
Protein GI194333363 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGGGG ATGCCAACTA TCTCAACCCC GTCATTGGCG CTTCGGTAAC GTCCGGCAAC 
GTCTACGGCC TGATCTATCC CGGTCTGATA CAGAGCGATT TCGATACCAC TACCGGTCTG
TTGAATTTTA TTGCCGTCGA AAAAAGTCTC CTTCCTCTGA CCAATTCCGG ATTGAAGGAT
CAAAAGCGTT CTGCAATAGC CAGGTCATGG TCCATGGCCG ACGACAACCT TTCGATCACC
TATACGCTTC GTGACGATAT TACCTGGAAT GACGGAACAC CTCTCACTGC TCATGATTTC
AGGTTTTCCT ATGAACTCTA TGGCAACCCT GTCATTGCCA GTCCCCGTCA GCAGTACCTT
GCCGAGCTGG TTGGTGCCGA TAAGGGAGCG ATAGATTTTG ACAGGGCCAT TGTCGTCCCC
GACGATACAA CACTGGTGTT TCATTTTTAC AAACCGGTTT CCGAACGCCT GGCACTGTTT
CATACTTCGC TGACACCGGT ACCCAAGCAT ATCTGGGAGG CGGTGCCTCC TGCAGAGTTT
CGCCAGTCAG GTCTCAATCA GACCCCTCTG GGCGCCGGAC CCTACACCTT GACATCGTGG
CAGAAACAGC AGCAGATCCT TCTGGAATCA AATCCGCTTT GTCGGCTTCC CAAGCCCGGT
AACATCAAGC GAATCATGTT TCGCATCGTT CCTGATTATA CCGTTCGTCT GACGCAGCTT
CAGACAGGGG TTGTCGACGT TGTCGAAAAT ATCAAACCCG AGGATTTCAG CGGCCTGGAA
TCAGCTTCGG CCGACATCGA CATCAAGTCT GTCGGGCTGA GAGTCTATGA CTATGTCGGG
TGGTCCAACA TCGATCAGGA AGCCTATCAC CGGGACGGAT CGCTTCTGCC TCATCCGCTG
TTCGGTTCGG CAGACGTCCG TCGCGCTCTT ACGCTGGCAA TCGATCGCCA GAGTATCATC
GACGGCTACC TTGGCCCATA CGGGGTCATC TGCAGTTCAG ATATCTCTCC ATCCATACGA
TGGGCATACA ACAACGACGT CAGGCCCTAC GGGTACGATC CCAATGAGGC CGTGCGCCTT
CTTGAGCAGG AGGGCTGGGT TCCCGGCCCT GACGGCATCC GTCAGAAAGA CGGCAGGAAG
TTCAGTTTTG CCCTCTATAC CAACGCAGGT AACTCCAGGA GAAATTTCGC CAGCGTTATT
ATTCAGCAGA ACCTTCGTGA AATAGGTATT GAGTGCCGTC TTGAGGTTCA GGAGTCAAAT
GTGTTTTTTG AAAATCTCCG TCTGAGAAAG CTCGATGCCT GGATGGCAGG ATGGTCGATA
GGGCTCGAAA TCGACCCGCT CGACGGATGG GGATCGGATC TTGAAAAGAG CCGCTTCAAT
TTCACCGGTT ATCGCAATTC CCGTATCGAC GAACTCTGCA TGCTGGCCAA AGAGGAGCTT
GACCCGGTTG ACGCCAGGCC CTACTGGATG GAGTATCAGG AAATTCTCCA TAGGGACCAG
CCGACCACTT TTCTTTACTG GATCAAAGAA ACGCAGGGTT TTAACCGCCG CGTCGAGGGA
GAAGAGGTCA ACATTCTGAG CACCTTCTAT AATATCGATG ACTGGACATT GAATCCGTCC
GCAACCGTTT CGCAATAA
 
Protein sequence
MLGDANYLNP VIGASVTSGN VYGLIYPGLI QSDFDTTTGL LNFIAVEKSL LPLTNSGLKD 
QKRSAIARSW SMADDNLSIT YTLRDDITWN DGTPLTAHDF RFSYELYGNP VIASPRQQYL
AELVGADKGA IDFDRAIVVP DDTTLVFHFY KPVSERLALF HTSLTPVPKH IWEAVPPAEF
RQSGLNQTPL GAGPYTLTSW QKQQQILLES NPLCRLPKPG NIKRIMFRIV PDYTVRLTQL
QTGVVDVVEN IKPEDFSGLE SASADIDIKS VGLRVYDYVG WSNIDQEAYH RDGSLLPHPL
FGSADVRRAL TLAIDRQSII DGYLGPYGVI CSSDISPSIR WAYNNDVRPY GYDPNEAVRL
LEQEGWVPGP DGIRQKDGRK FSFALYTNAG NSRRNFASVI IQQNLREIGI ECRLEVQESN
VFFENLRLRK LDAWMAGWSI GLEIDPLDGW GSDLEKSRFN FTGYRNSRID ELCMLAKEEL
DPVDARPYWM EYQEILHRDQ PTTFLYWIKE TQGFNRRVEG EEVNILSTFY NIDDWTLNPS
ATVSQ