Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0374 |
Symbol | |
ID | 5897648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 414507 |
End bp | 416390 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641560859 |
Product | amino acid/peptide transporter |
Protein accession | YP_001682009 |
Protein GI | 167644346 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCG TTGTCGCCGC CGGCATCCTG GTCACCCTCG TGACCGGCGT GCCCGTGCTT ATCCAACTGC TGCGGGGCCA TCCGCGCGGT CTGATCATTT GTTTCCTGGC CGAGATGTGG GAGCGGTTCT CCTACTACGG CATGCGCGGG CTGCTGATCT TCTATCTGAC CCAGCACTTC CTGTTCGATT CGAAGACGGC CGGCGGTCAC TACGGCTCCT ACACCTCGCT GGTCTATATC GTGCCGCTGC TCGGCGGCTT CCTGGCGGAC CGCTATCTGG GCACCCGCAA GGCCGTGGCG TTCGGGGCCA TACTGTTGGT GGCGGGCCAC CTGACCATGG CCGTCGAAGG CCGTCCGGCG ACCCAGACCC TGGACTATGC AGGCCAGACC TATGAGTTCC AGGTCAAGGG CCGCGGCGAG GAGCGGGTCG CCAAGATCAT CGTCGCCGAC AAGCCCTACG AGGTGGCCGC CAACGACAAG GGCGACTTCG AGATCAAGAA TCTGCCGGCC CAGTCGGCCA TTCCCTCGGT GCTGCCCAAG GGCCAGTACC AGCTGGGCGT CAAGGATCGC GATCCGCTCT ATCTGAACAT CTTCTGGCTG GCCCTGTCGT TGATCATCGT CGGCGTCGGC TTCATGAAGG CCAATGTCGC CACCCTGGTG GGCCAACTCT ATCCGCAGGG CGATCCGCGC CGCGACCCAG GGTTCACCCT CTATTATTAC GGCATCAATC TCGGCTCGTT CTGGGCCGCG ATCCTCTGCG GCCTGCTGGG GGTCAATGTG GGCTGGAACG CCGGGTTCGG CATGGCCGGC ATCGGCATGC TGGCCGGTTT CATCGTGTTC GTGCTGGGCA AGCCGCTGCT GCTGGGCAAG GGCGAGCCGC CCGAACCCAA GACGCTGAAG GCCCCGGTCG TCGGCCCGGT CAACCGCGAG GTCATCATCT ATGCCGGCTC GCTGGGCGTG GTCGGCGCGG TCTTCTTCCT GGTGCAATAT ACCCCGGTGG TCAGCGCCAC CCTGATCGCC GGCATGTTCG GCTCGCTGGG CTACATCCTG TGGTTCGCCT TCGTGAAATG CGAGAAGGTC GAGCGCGAGC GACTGCTGCT GGCCACGGTG CTGGTGCTGG GCGCGGTGGT GTTCTGGACC CTGTTCGAAC AGGCCGGCTC GTCGCTGAAC CTGTTCGCGG CCACCAACGT CAACCTGACC CTGCTGGCCA AGCCGGTGAC CTGGTTCAAT GGCGCGGTGA TCCTCGGCGC GCCCGAGCAA CTGCGGGCGG CGGGCATCGA CCCGGCCAGC GGCTTCTGGG TCAACACCTC GTTCAACGCC GCCCAGACCC AGGCCATCAA CGCCGGCTGG ATCCTGATCT TCGCGCCGTT GTTCGCGGCG ATGTGGACCT TCCTGGGGTT CCGCGGTCGC AATCCGGGGC CGATGGTCAA GTTCGGCCTG TCGCTGATCC AGGTGGGCGC GGGCTTCCTT GTCCTACTGA TCGGGGCGCA GTTCGCCGAC GGCGCGTTCC GCATGCCGCT CATCTTCCTG GTCGTCATGT ACATGCTGCA CACCTCGGGC GAGATGTTCA TGTCGCCGGT CGGGCTGTCG CAGATGACCA AGTTGTCGCC GCTATCGATC GTCTCGTTCG TGATGGCCGT CTGGTACATG GCCCTGGCCA TGGCCAACCT GTTCGGCGGT TGGATCGCGG GGATCGCCTC GACCGAGACC ATCGGCGGCC AGGTGCTGGA CCCGGCCGCG GCCATGGCCC AGTCGCTGCT GGTGTTCAAG ATCATCGGCC TGATCTCGAT CGGCATCGGC GTGCTGTTCC TGGCGCTGTC GCCGGTCCTC AAGAAGTGGT CGCACGGCTC CGACGACACC AACCCAGAAC CCGTCGCCCC TTAG
|
Protein sequence | MNIVVAAGIL VTLVTGVPVL IQLLRGHPRG LIICFLAEMW ERFSYYGMRG LLIFYLTQHF LFDSKTAGGH YGSYTSLVYI VPLLGGFLAD RYLGTRKAVA FGAILLVAGH LTMAVEGRPA TQTLDYAGQT YEFQVKGRGE ERVAKIIVAD KPYEVAANDK GDFEIKNLPA QSAIPSVLPK GQYQLGVKDR DPLYLNIFWL ALSLIIVGVG FMKANVATLV GQLYPQGDPR RDPGFTLYYY GINLGSFWAA ILCGLLGVNV GWNAGFGMAG IGMLAGFIVF VLGKPLLLGK GEPPEPKTLK APVVGPVNRE VIIYAGSLGV VGAVFFLVQY TPVVSATLIA GMFGSLGYIL WFAFVKCEKV ERERLLLATV LVLGAVVFWT LFEQAGSSLN LFAATNVNLT LLAKPVTWFN GAVILGAPEQ LRAAGIDPAS GFWVNTSFNA AQTQAINAGW ILIFAPLFAA MWTFLGFRGR NPGPMVKFGL SLIQVGAGFL VLLIGAQFAD GAFRMPLIFL VVMYMLHTSG EMFMSPVGLS QMTKLSPLSI VSFVMAVWYM ALAMANLFGG WIAGIASTET IGGQVLDPAA AMAQSLLVFK IIGLISIGIG VLFLALSPVL KKWSHGSDDT NPEPVAP
|
| |