Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccur_09750 |
Symbol | |
ID | 8375182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptobacterium curtum DSM 15641 |
Kingdom | Bacteria |
Replicon accession | NC_013170 |
Strand | - |
Start bp | 1114187 |
End bp | 1115749 |
Gene Length | 1563 bp |
Protein Length | 520 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644993897 |
Product | sodium/proline symporter |
Protein accession | YP_003151348 |
Protein GI | 256827389 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG0591] Na+/proline symporter |
TIGRFAM ID | [TIGR00813] transporter, SSS family [TIGR02121] sodium/proline symporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000003829 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 72 |
Fosmid unclonability p-value | 0.000045945 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTTAAGA GCGACTTTTG GGTATTGTTC GCAATGCTCA TCTACTTTAT TGCAGTGGTG ATCATTGGTT TCGTATACGC GAAACGTGCC AATAAATCTA CCGAGGCATA TTTTCTCGGT GGTCGTTCTT TGGGGCCTTG GGTAACAGCT TTATCGGCTG AAGCAAGCGA CATGTCTGGG TGGCTGCTCA TGGGTATACC GGGCGTGGCG TATTTTACGG GCGCATCAGA GGTCCTCTGG ACGGTTATTG GGCTGGCGAT TGGCACCTAT CTGAACTGGC TACTTGTTGC CCGACGACTG CGAGTGTATT CAGAAGTTGC GGGTAACGCA ATTACGCTGC CGAACTTCTT TTCAAACCGT TTTCACGATC ACAAGAACAC TCTGATGACC ATTGCGGCGC TGATCATTAT CGTATTCTTC AGCATTTATG TCGGCAGCTG TTTTGTGGCA GTTGGCAAGC TGTTCACCAC GCTATTCGGT CTTGACTACG CCACGATGAT GGTGCTCGGT GCGCTTATTG TGTTCTTGTA TACTTTCGTT GGGGGGTACC TTTCTGTTTC CACAACCGAC CTCGTGCAGG GCACTATCAT GATATGTGCA CTTGCCATTG TTTTTGTGGG CTCGGTTGTG CAGGCGGGTG GTGTTGAAAA CACGGTGGCC TTTCTTAAAG AAATACCCGG TTTTCTTTCG GGGACGCAAA CAGCCTCTCC CATACTTGAT ACAAATGGCT TACAGCAGGT AGTTGATTCC ATTCCGCAGT TCGGCGATGC AAAGGACTAC GGACTCATTA CGATTGTGTC CGGACTGGTA TGGGGTTTAG GGTATTTCGG CATGCCGCAG GTGCTCGTGC GGTTTATGAG TGCTCGAAGT GCTGACGATA TTAAGATGTC ACGTCGCATT GCAACGGTAT GGTGCGTGGT TTCCATGGCG TGCGCGCTTT GCATCGGGCT TGTTGGTCGT GCGGTACTGC CAGGTGTTTT TCTGACGCAA TCGGCTGCCG AGAGTATCTT TATTATTCTG TCGCAGATGA TTCTTCCCTC ATTTATCTGT GGGTTAGTGG TATCGGGCAT TTTTGCGGCA TCGATGTCAT CATCTTCGTC GTATTTGCTG ATATCGGGTT CGGCCGTCGC GGAAAATATC TATCGTGGGC TTATTCGTCG CGATGCTACT GACACACAAG TGATGATCGT TGCCCGTATT ACGCTTGTGG TTGTATTGCT GTTTGGTATT GTGATTGCAC TCGACCAGGA TTCATCGATC TTCCAGGTGG TGTCGTATGC GTGGGCTGGT TTTGGTGCTT CCTTTGGTCC GTTAATGCTG GCGAGTTTGT ATTGGAAGCG CACGACTAAA CAGGGTGCGC TTGCGGGTAT GATCACGGGT GCGGCCACGG TGTTGATCTG GCATACGTTC ATCAAGCCAC TCGGTGGGGT ATTTGGTGTG TACGAGCTCC TTCCGGCCTT CGTTTTGGCA CTGGTGGCAC TTGTAGTGGT GTCGCTTGTT ACCGCTACTC CCGAGCAAAG TGTGCTTGAT GAATTTGATC GTTATGAAGC GCAGTTGAAG TAA
|
Protein sequence | MVKSDFWVLF AMLIYFIAVV IIGFVYAKRA NKSTEAYFLG GRSLGPWVTA LSAEASDMSG WLLMGIPGVA YFTGASEVLW TVIGLAIGTY LNWLLVARRL RVYSEVAGNA ITLPNFFSNR FHDHKNTLMT IAALIIIVFF SIYVGSCFVA VGKLFTTLFG LDYATMMVLG ALIVFLYTFV GGYLSVSTTD LVQGTIMICA LAIVFVGSVV QAGGVENTVA FLKEIPGFLS GTQTASPILD TNGLQQVVDS IPQFGDAKDY GLITIVSGLV WGLGYFGMPQ VLVRFMSARS ADDIKMSRRI ATVWCVVSMA CALCIGLVGR AVLPGVFLTQ SAAESIFIIL SQMILPSFIC GLVVSGIFAA SMSSSSSYLL ISGSAVAENI YRGLIRRDAT DTQVMIVARI TLVVVLLFGI VIALDQDSSI FQVVSYAWAG FGASFGPLML ASLYWKRTTK QGALAGMITG AATVLIWHTF IKPLGGVFGV YELLPAFVLA LVALVVVSLV TATPEQSVLD EFDRYEAQLK
|
| |