Gene Caul_0374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0374 
Symbol 
ID5897648 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp414507 
End bp416390 
Gene Length1884 bp 
Protein Length627 aa 
Translation table11 
GC content65% 
IMG OID641560859 
Productamino acid/peptide transporter 
Protein accessionYP_001682009 
Protein GI167644346 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCG TTGTCGCCGC CGGCATCCTG GTCACCCTCG TGACCGGCGT GCCCGTGCTT 
ATCCAACTGC TGCGGGGCCA TCCGCGCGGT CTGATCATTT GTTTCCTGGC CGAGATGTGG
GAGCGGTTCT CCTACTACGG CATGCGCGGG CTGCTGATCT TCTATCTGAC CCAGCACTTC
CTGTTCGATT CGAAGACGGC CGGCGGTCAC TACGGCTCCT ACACCTCGCT GGTCTATATC
GTGCCGCTGC TCGGCGGCTT CCTGGCGGAC CGCTATCTGG GCACCCGCAA GGCCGTGGCG
TTCGGGGCCA TACTGTTGGT GGCGGGCCAC CTGACCATGG CCGTCGAAGG CCGTCCGGCG
ACCCAGACCC TGGACTATGC AGGCCAGACC TATGAGTTCC AGGTCAAGGG CCGCGGCGAG
GAGCGGGTCG CCAAGATCAT CGTCGCCGAC AAGCCCTACG AGGTGGCCGC CAACGACAAG
GGCGACTTCG AGATCAAGAA TCTGCCGGCC CAGTCGGCCA TTCCCTCGGT GCTGCCCAAG
GGCCAGTACC AGCTGGGCGT CAAGGATCGC GATCCGCTCT ATCTGAACAT CTTCTGGCTG
GCCCTGTCGT TGATCATCGT CGGCGTCGGC TTCATGAAGG CCAATGTCGC CACCCTGGTG
GGCCAACTCT ATCCGCAGGG CGATCCGCGC CGCGACCCAG GGTTCACCCT CTATTATTAC
GGCATCAATC TCGGCTCGTT CTGGGCCGCG ATCCTCTGCG GCCTGCTGGG GGTCAATGTG
GGCTGGAACG CCGGGTTCGG CATGGCCGGC ATCGGCATGC TGGCCGGTTT CATCGTGTTC
GTGCTGGGCA AGCCGCTGCT GCTGGGCAAG GGCGAGCCGC CCGAACCCAA GACGCTGAAG
GCCCCGGTCG TCGGCCCGGT CAACCGCGAG GTCATCATCT ATGCCGGCTC GCTGGGCGTG
GTCGGCGCGG TCTTCTTCCT GGTGCAATAT ACCCCGGTGG TCAGCGCCAC CCTGATCGCC
GGCATGTTCG GCTCGCTGGG CTACATCCTG TGGTTCGCCT TCGTGAAATG CGAGAAGGTC
GAGCGCGAGC GACTGCTGCT GGCCACGGTG CTGGTGCTGG GCGCGGTGGT GTTCTGGACC
CTGTTCGAAC AGGCCGGCTC GTCGCTGAAC CTGTTCGCGG CCACCAACGT CAACCTGACC
CTGCTGGCCA AGCCGGTGAC CTGGTTCAAT GGCGCGGTGA TCCTCGGCGC GCCCGAGCAA
CTGCGGGCGG CGGGCATCGA CCCGGCCAGC GGCTTCTGGG TCAACACCTC GTTCAACGCC
GCCCAGACCC AGGCCATCAA CGCCGGCTGG ATCCTGATCT TCGCGCCGTT GTTCGCGGCG
ATGTGGACCT TCCTGGGGTT CCGCGGTCGC AATCCGGGGC CGATGGTCAA GTTCGGCCTG
TCGCTGATCC AGGTGGGCGC GGGCTTCCTT GTCCTACTGA TCGGGGCGCA GTTCGCCGAC
GGCGCGTTCC GCATGCCGCT CATCTTCCTG GTCGTCATGT ACATGCTGCA CACCTCGGGC
GAGATGTTCA TGTCGCCGGT CGGGCTGTCG CAGATGACCA AGTTGTCGCC GCTATCGATC
GTCTCGTTCG TGATGGCCGT CTGGTACATG GCCCTGGCCA TGGCCAACCT GTTCGGCGGT
TGGATCGCGG GGATCGCCTC GACCGAGACC ATCGGCGGCC AGGTGCTGGA CCCGGCCGCG
GCCATGGCCC AGTCGCTGCT GGTGTTCAAG ATCATCGGCC TGATCTCGAT CGGCATCGGC
GTGCTGTTCC TGGCGCTGTC GCCGGTCCTC AAGAAGTGGT CGCACGGCTC CGACGACACC
AACCCAGAAC CCGTCGCCCC TTAG
 
Protein sequence
MNIVVAAGIL VTLVTGVPVL IQLLRGHPRG LIICFLAEMW ERFSYYGMRG LLIFYLTQHF 
LFDSKTAGGH YGSYTSLVYI VPLLGGFLAD RYLGTRKAVA FGAILLVAGH LTMAVEGRPA
TQTLDYAGQT YEFQVKGRGE ERVAKIIVAD KPYEVAANDK GDFEIKNLPA QSAIPSVLPK
GQYQLGVKDR DPLYLNIFWL ALSLIIVGVG FMKANVATLV GQLYPQGDPR RDPGFTLYYY
GINLGSFWAA ILCGLLGVNV GWNAGFGMAG IGMLAGFIVF VLGKPLLLGK GEPPEPKTLK
APVVGPVNRE VIIYAGSLGV VGAVFFLVQY TPVVSATLIA GMFGSLGYIL WFAFVKCEKV
ERERLLLATV LVLGAVVFWT LFEQAGSSLN LFAATNVNLT LLAKPVTWFN GAVILGAPEQ
LRAAGIDPAS GFWVNTSFNA AQTQAINAGW ILIFAPLFAA MWTFLGFRGR NPGPMVKFGL
SLIQVGAGFL VLLIGAQFAD GAFRMPLIFL VVMYMLHTSG EMFMSPVGLS QMTKLSPLSI
VSFVMAVWYM ALAMANLFGG WIAGIASTET IGGQVLDPAA AMAQSLLVFK IIGLISIGIG
VLFLALSPVL KKWSHGSDDT NPEPVAP