Gene Caul_1786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1786 
Symbol 
ID5899241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1887943 
End bp1889016 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content67% 
IMG OID641562276 
Productarsenical-resistance protein 
Protein accessionYP_001683413 
Protein GI167645750 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0798] Arsenite efflux pump ACR3 and related permeases 
TIGRFAM ID[TIGR00832] arsenical-resistance protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0258081 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCATCT TCGAACGCTA CCTGACCCTG TGGGTGGCGC TGTGCATCGT CGTCGGCGTG 
GCGCTGGGGC ATTTCTTCTC GCCGGCCTTT CATGCCGTCG CCGCCCTGGA GGTCGCCAAG
GTCAACCTGC CGATGGCCGG CCTGATCTGG CTGATGATCA TCCCGATGCT CATCAAGGTC
GATTTCGCGG CCCTGGGCGA GGTGCGCAAC CACTGGCGCG GCATCGGCGT GACGCTGTTC
ATCAACTGGG TGGTCAAGCC GTTCGGCATG GCGCTCCTGG GCTGGATCTT CATCGCCCAC
CTGTTTCGCC CCTGGCTGCC GGCCGAGCAG GTCGAAAGCT ATATCGCCGG CCTGATCCTG
CTGGCGGCGG CGCCCTGTAC GGCGATGGTG TTCGTCTGGA GCAACCTGAC CGACGGCGAG
CCCAACTTCA CCCTCAGCCA GGTGGCGCTG AACGATACGA TCATGGTCGT GGCCTTCGCC
CCGATCGTCG GCCTGCTGCT GGGCCTGTCG GCGATCACCG TGCCGTGGTC GACCCTGACC
CTGTCGGTGG GGCTCTATAT CCTCGTTCCG GTCCTGGCGG CGCAAGTGAT CCGCCGGGTC
CTGCTGGCGC GCGGTCCGCA GGCGCTGGCC AGCGTCCTGG CCAGGCTGCA GCCGCTGTCG
ATCGCGGCGC TGCTGGCGAC GTTGGTCCTG CTGTTCGGCT TCCAGGGCGA CCAGATCCTG
AAGCAGCCGC TGATCATCGC GATCCTCGCC GCGCCGATCC TGATCCAGGT CTATTTCAAC
GCCGGCCTGG CCTACATCCT CAATCGGATC ACCGGCGAAG CCCACTGCGT GGCTGGCCCT
TCGGCCCTGA TCGGGGCCAG CAACTTCTTC GAGCTGGCGG TGGCCGCCGC CATCAGCATC
TTCGGATTCC AATCCGGCGC GGCCCTGGCC ACCGTGGTCG GGGTGTTGAT CGAGGTGCCG
GTCATGTTGT CGATCGTCGC CATCGTGAAC GCCAGCAAGG CCTGGTACGA ACGCGGCGGC
GCCGTCCGCG CCGTGGCCGC CCGCCGCAAG ACCCTCTCTT CAAAGCCCCG GTGA
 
Protein sequence
MSIFERYLTL WVALCIVVGV ALGHFFSPAF HAVAALEVAK VNLPMAGLIW LMIIPMLIKV 
DFAALGEVRN HWRGIGVTLF INWVVKPFGM ALLGWIFIAH LFRPWLPAEQ VESYIAGLIL
LAAAPCTAMV FVWSNLTDGE PNFTLSQVAL NDTIMVVAFA PIVGLLLGLS AITVPWSTLT
LSVGLYILVP VLAAQVIRRV LLARGPQALA SVLARLQPLS IAALLATLVL LFGFQGDQIL
KQPLIIAILA APILIQVYFN AGLAYILNRI TGEAHCVAGP SALIGASNFF ELAVAAAISI
FGFQSGAALA TVVGVLIEVP VMLSIVAIVN ASKAWYERGG AVRAVAARRK TLSSKPR